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sequence listing 
SEQUENCE LISTING 

<110> Kock, Michael 
Frank, Markus 
Badur, Ralf 

<120> Novel selection Processes 

<130> 532622010400 

<140> To be assigned 
<141> 2005-01-24 

<160> 71 

<170> Patentin ver. 2.1 

<210> 1 
<211> 1284 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (1)..(1281) . 

<223> coding for cytosine deaminase (codA) 

<400> 1 

gtg teg aat aac get tta caa aca att att aac gec egg tta cca ggc 

Val Ser Asn Asn Ala Leu Gin Thr lie lie Asn Ala Arg Leu Pro Gly 

15 10 15 



gec gaa caa gqt tta gtt ata ccg ccg ttt gtg gag cca cat att cac 

Ala Glu Gin Gly Leu Val lie Pro Pro Phe val Glu Pro His He His 

50 55 60 

ctg gac acc acg caa acc gec gqa caa ccg aac tgg aat cag tec ggc 

Leu Asp Thr Thr Gin Thr Ala Gly Gin Pro Asn Trp Asn Gin Ser Gly 

65 70 75 80 

acg ctg ttt gaa ggc att gaa cgc tgg gec gag cgc aaa gcg tta tta 

Thr Leu Phe Glu Gly lie Glu Arg Trp Ala Glu Arg Lys Ala Leu Leu 

85 90 ~ " 95 



att gec aac ggc att cag cat gtg cgt acc cat gtc gat gtt teg gat 
lie Ala Asn Gly lie Gin His Val Arg Thr His Val Asp val Ser Asp 
115 120 ~ 125 



48 



gaa gag ggg ctg tgg cag att cat ctg cag gac gqa aaa ate age gee 96 

Glu Glu Gly Leu Trp Gin lie His Leu Gin Asp Gly Lys lie Ser Ala 
20 25 30 

att gat gcg caa tec ggc gtg atg ccc ata act gaa aac age ctg gat 144 

lie Asp Ala Gin Ser Gly val Met Pro lie Thr Glu Asn Ser Leu Asp 
35 40 45 



192 



240 



288 



acc cat gac gat gtg aaa caa cgc gca tgg caa acg ctg aaa tgg cag 336 
Thr His Asp Asp val Lys Gin Arg Ala Trp Gin Thr Leu Lys Trp Gin 
100 105 110 



384 



gca acg eta act gcg ctg aaa gca atg ctg gaa gtg aag cag gaa gtc 432 
Ala Thr Leu Thr Ala Leu Lys Ala Met Leu Glu val Lys Gin Glu val 
130 135 140 
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sequence listing 

gcg ccg tgg att gat ctg caa ate gtc gec ttc cct cag gaa ggg att 

Ala Pro Trp lie Asp Leu Gin lie val Ala Phe Pro Gin Glu Gly lie 

145 150 155 160 



att gec gec gga aac age gec aac ctg att ate ctg ccg get gaa aat 
lie Ala Ala Gly Asn ser Ala Asn Leu lie lie Leu Pro Ala Glu Asn 
370 375 380 



480 



624 



ttg teg tat ccc aac gqt gaa gcg ttg ctg gaa gag gcg tta cgc tta 528 
Leu Ser Tyr Pro Asn Gly Glu Ala Leu Leu Glu Glu Ala Leu Arg Leu 
165 170 175 

gqg gca gat gta gtq gqg gcg att ccg cat ttt gaa ttt acc cgt gaa 576 
Gly Ala Asp val Val Gly Ala lie Pro His Phe Glu Phe Thr Arg Glu 
180 185 190 

tac ggc gtq gag teg ctg cat aaa acc ttc gee ctg gcg caa aaa tac 
Tyr Gly val Glu Ser Leu His Lys Thr Phe Ala Leu Ala Gin Lys Tyr 
195 200 205 

gac cgt etc ate gac gtt cac tgt gat gag ate gat gac gag cag teg 672 
Asp Arg Leu lie Asp val His Cys Asp Glu lie Asp Asp Glu Gin Ser 
210 215 fc 220 

cgc ttt gtc gaa acc gtt get gec ctg gcg cac cat gaa ggc atg ggc 720 
Arg Phe Val Glu Thr val Ala Ala Leu Ala His His Glu Gly Met Gly 
225 230 235 240 

gcg cga gtc acc gee age cac acc acg gca atg cac tec tat aac gqg 
Ala Arg val Thr Ala Ser His Thr Thr Ala Met His Ser Tyr Asn Gly 
245 250 255 

gcg tat acc tea cgc ctg ttc cgc ttg ctg aaa atg tec gqt att aac 
Ala Tyr Thr Ser Arg Leu Phe Arg Leu Leu Lys Met Ser Gly lie Asn 
260 265 270 

ttt gtc gee aac ccg ctg gtc aat att cat ctg caa gga cgt ttc gat 864 
Phe val Ala Asn Pro Leu val Asn lie His Leu Gin Gly Arg Phe Asp 
275 280 285 

acg tat cca aaa cgt cgc ggc ate acg cgc gtt aaa gag atg ctg gag 912 
Thr Tyr Pro Lys Arg Arg Gly lie Thr Arg val Lys Glu Met Leu Glu 
290 295 300 

tec ggc att aac gtc tgc ttt gqt cac gat gat gtc ttc gat ccg tgg 
Ser Gly lie Asn val Cys Phe Gly His Asp Asp Val Phe Asp Pro Trp 
305 310 315 320 

tat ccg ctg gga acg gcg aat atg ctg caa gtq ctg cat atg gqg ctg 1008 
Tyr Pro Leu Gly Thr Ala Asn Met Leu Gin val Leu His Met Gly Leu 
325 330 335 



768 



816 



960 



1056 



cat gtt tgc cag ttg atg ggc tac gqg cag att aac gat ggc ctg aat 
His Val Cys Gin Leu Met Gly Tyr Gly Gin lie Asn Asp Gly Leu Asn 
340 345 350 

tta ate acc cac cac age gca agg acg ttg aat ttg cag gat tac ggc 1104 
Leu lie Thr His His Ser Ala Arg Thr Leu Asn Leu Gin Asp Tyr Gly 
355 360 365 



1152 



gqg ttt gat gcg ctg cgc cgt cag gtt ccg gta cgt tat teg gta cgt 1200 

Gly Phe Asp Ala Leu Arg Arg Gin val Pro val Arg Tyr ser val Arg 
385 390 395 400 
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ggc ggc aag gtq att gcc age aca caa ccg gca caa acc acc gta tat 1248 

Gly Gly Lys val lie Ala Ser Thr Gin Pro Ala Gin Thr Thr Val Tyr 

405 410 415 

ctg gag cag cca gaa gcc ate gat tac aaa cgt tga 1284 
Leu Glu Gin Pro Glu Ala lie Asp Tyr Lys Arg 
420 425 

<210> 2 
<211> 427 
<212> PRT 

<213> Escherichia coli 
<400> 2 

val Ser Asn Asn Ala Leu Gin Thr lie lie Asn Ala Arg Leu Pro Gly 
15 10 15 

Glu Glu Gly Leu Trp Gin lie His Leu Gin Asp Gly Lys lie Ser Ala 
20 25 30 

lie Asp Ala Gin Ser Gly val Met Pro lie Thr Glu Asn Ser Leu Asp 
35 40 45 

Ala Glu Gin Gly Leu val lie Pro Pro Phe val Glu Pro His lie His 
50 55 60 

Leu Asp Thr Thr Gin Thr Ala Gly Gin Pro Asn Trp Asn Gin Ser Gly 
65 70 75 80 

Thr Leu Phe Glu Gly lie Glu Arg Trp Ala Glu Arg Lys Ala Leu Leu 

85 90 ~ * 95 

Thr His Asp Asp val Lys Gin Arg Ala Trp Gin Thr Leu Lys Trp Gin 
100 105 110 

lie Ala Asn Gly lie Gin His val Arg Thr His Val Asp val Ser Asp 
115 120 125 

Ala Thr Leu Thr Ala Leu Lys Ala Met Leu Glu val Lys Gin Glu Val 
130 135 140 

Ala Pro Trp lie Asp Leu Gin lie val Ala Phe Pro Gin Glu Gly lie 
145 150 155 160 

Leu Ser Tyr Pro Asn Gly Glu Ala Leu Leu Glu Glu Ala Leu Arg Leu 
165 * 170 175 

Gly Ala Asp val Val Gly Ala lie Pro His Phe Glu Phe Thr Arg Glu 
180 185 190 

Tyr Gly val Glu Ser Leu His Lys Thr Phe Ala Leu Ala Gin Lys Tyr 
195 200 205 

Asp Arg Leu lie Asp Val His Cys Asp Glu lie Asp Asp Glu Gin Ser 
210 215 220 

Arg Phe val Glu Thr Val Ala Ala Leu Ala His His Glu Gly Met Gly 
225 230 235 240 

Ala Arg val Thr Ala Ser His Thr Thr Ala Met His Ser Tyr Asn Gly 
245 250 255 
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Ala Tyr Thr Ser Arg Leu Phe Arg Leu Leu Lys Met Ser Gly lie Asn 
260 265 270 

Phe val Ala Asn Pro Leu val Asn lie His Leu Gin Gly Arg Phe Asp 
275 280 285 

Thr Tyr Pro Lys Arg Arg Gly lie Thr Arg val Lys Glu Met Leu Glu 
290 295 ~ 300 

Ser Gly lie Asn val Cys Phe Gly His Asp Asp val Phe Asp Pro Trp 
305 310 315 320 

Tyr Pro Leu Gly Thr Ala Asn Met Leu Gin val Leu His Met Gly Leu 
325 330 335 

His val Cys Gin Leu Met Gly Tyr Gly Gin lie Asn Asp Gly Leu Asn 
340 345 350 

Leu lie Thr His His Ser Ala Arg Thr Leu Asn Leu Gin Asp Tyr Gly 
355 360 365 

lie Ala Ala Gly Asn Ser Ala Asn Leu lie lie Leu Pro Ala Glu Asn 
370 375 380 

Gly Phe Asp Ala Leu Arg Arg Gin Val Pro Val Arg Tyr Ser Val Arg 
385 390 ~ 395 400 

Gly Gly Lys Val lie Ala Ser Thr Gin Pro Ala Gin Thr Thr val Tyr 
405 410 415 

Leu Glu Gin Pro Glu Ala lie Asp Tyr Lys Arg 
420 425 

<210> 3 
<211> 1284 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: coding for 
cytosine deaminase (codA) 

<220> 

<221> misc_feature 

<222> (1)..(3) , ^ 

<223> mutation of GTG to ATG start codon for expression 
in eukaryotic hosts 

<220> 

<221> CDS 

<222> (1)..(1281) 

<223> coding for cytosine deaminase (codA) 
<400> 3 

atg teg aat aac get tta caa aca att att aac gec egg tta cca gqc 
Met Ser Asn Asn Ala Leu Gin Thr lie lie Asn Ala Arg Leu Pro Gly 
15 10 15 



48 



gaa gag gqg ctg tgg cag att cat ctg cag gac gga aaa ate age gee 96 

Glu Glu Gly Leu Trp Gin lie His Leu Gin Asp Gly Lys lie Ser Ala 

20 25 30 

att gat gcg caa tec gqc gtg atg ccc ata act gaa aac age ctg gat 144 

lie Asp Ala Gin Ser Gly Val Met Pro lie Thr Glu Asn ser Leu Asp 

35 40 45 

Page 4 



sequence listing 

gcc gaa caa ggt tta gtt ata ccg ccg ttt gtg gag cca cat att cac 192 
Ala Glu Gin Gly Leu val lie Pro Pro Phe val Glu Pro His lie His 
50 55 60 



ctg gac acc acg caa acc gcc gga caa ccg aac tgg aat cag tec ggc 
Leu Asp Thr Thr Gin Thr Ala Gly Gin Pro Asn Trp Asn Gin Ser Gly 
65 70 75 80 

acg ctg ttt gaa ggc att gaa cgc tgg gcc gag cgc aaa gcg tta tta 
Thr Leu Phe Glu Gly lie Glu Arg Trp Ala Glu Arg Lys Ala Leu Leu 
85 90 95 



gcg ccg tgg att gat ctg caa ate gtc gcc ttc cct cag gaa ggg att 
Ala Pro Trp lie Asp Leu Gin lie val Ala Phe Pro Gin Glu Gly He 
145 150 155 160 



240 



288 



acc cat gac gat gtq aaa caa cgc gca tgg caa acg ctg aaa tgg cag 336 

Thr His Asp Asp val Lys Gin Arg Ala Trp Gin Thr Leu Lys Trp Gin 

100 105 110 

att gcc aac ggc att cag cat gtq cgt acc cat gtc gat gtt teg gat 384 

lie Ala Asn Gly lie Gin His val Arg Thr His val Asp val Ser Asp 

115 120 125 

gca acg eta act gcg ctg aaa gca atg ctg gaa gtq aag cag gaa gtc 432 

Ala Thr Leu Thr Ala Leu Lys Ala Met Leu Glu Val Lys Gin Glu val 

130 135 140 



480 



ttg teg tat ccc aac ggt gaa gcg ttg ctg gaa gag gcg tta cgc tta 528 
Leu Ser Tyr Pro Asn Gly Glu Ala Leu Leu Glu Glu Ala Leu Arg Leu 
165 170 175 

ggg gca gat gta gtg ggg gcg att ccg cat ttt gaa ttt acc cgt gaa 576 
Gly Ala Asp val val Gly Ala lie Pro His Phe Glu Phe Thr Arg Glu 
180 185 190 

tac ggc gtg gag teg ctg cat aaa acc ttc gcc ctg gcg caa aaa tac 
Tyr Gly val Glu ser Leu His Lys Thr Phe Ala Leu Ala Gin Lys Tyr 
195 200 205 



624 



gac cgt etc ate gac gtt cac tgt gat gag ate gat gac gag cag teg 672 

Asp Arg Leu lie Asp Val His Cys Asp Glu lie Asp Asp Glu Gin Ser 

210 215 220 

cgc ttt gtc gaa acc gtt get gcc ctg gcg cac cat gaa ggc atg ggc 720 

Arg Phe val Glu Thr val Ala Ala Leu Ala His His Glu Gly Met Gly 
225 230 235 240 

gcg cga gtc acc gcc age cac acc acg gca atg cac tec tat aac ggg 768 

Ala Arg Val Thr Ala ser His Thr Thr Ala Met His Ser Tyr Asn Gly 
245 250 255 

gcg tat acc tea cgc ctg ttc cgc ttg ctg aaa atg tec ggt att aac 816 

Ala Tyr Thr Ser Arg Leu Phe Arg Leu Leu Lys Met Ser Gly lie Asn 
260 ~ 265 270 

ttt gtc gcc aac ccg ctg gtc aat att cat ctg caa gga cgt ttc gat 864 

Phe val Ala Asn Pro Leu val Asn lie His Leu Gin Gly Arg Phe Asp 
275 280 285 

acg tat cca aaa cgt cgc ggc ate acg cgc gtt aaa gag atg ctg gag 912 
Thr Tyr Pro Lys Arg Arg Gly lie Thr Arg val Lys Glu Met Leu Glu 
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sequence listing 
290 295 300 

tec ggc att aac gtc tgc ttt ggt cac gat gat gtc ttc gat ccg tgg 
ser Gly He Asn Val Cys Phe Gly His Asp Asp Val Phe Asp Pro Trp 
305 310 315 320 



cat gtt tgc cag ttg atg ggc tac ggg cag att aac gat ggc ctg aat 
His val cys Gin Leu Met Gly Tyr Gly Gin lie Asn Asp Gly Leu Asn 
340 345 350 



ggc ggc aag gtq att gec age aca caa ccg gca caa acc acc gta tat 
Gly Gly Lys Val lie Ala Ser Thr Gin Pro Ala Gin Thr Thr val Tyr 
405 410 415 



960 



tat ccg ctg gqa acg gcg aat atg ctg caa gtq ctg cat atg ggg ctg 1008 
Tyr Pro Leu Gly Thr Ala Asn Met Leu Gin Val Leu His Met Gly Leu 
325 330 335 



1056 



tta ate acc cac cac age gca agg acg ttg aat ttg cag gat tac ggc 1104 
Leu lie Thr His His Ser Ala Arg Thr Leu Asn Leu Gin Asp Tyr Gly 
355 360 365 

att gec gee gqa aac age gee aac ctg att ate ctg ccg get gaa aat 1152 
lie Ala Ala Gly Asn Ser Ala Asn Leu lie lie Leu Pro Ala Glu Asn 
370 375 380 



ggg ttt gat gcg ctg cgc cgt cag gtt ccg gta cgt tat teg gta cgt 1200 
Gly Phe Asp Ala Leu Arg Arg Gin val Pro Val Arg Tyr Ser Val Arg 
385 390 395 400 



1248 



ctg gag cag cca gaa gee ate gat tac aaa cgt tga 1284 
Leu Glu Gin Pro Glu Ala lie Asp Tyr Lys Arg 
420 425 

<210> 4 
<211> 427 
<212> PRT 

<213> Artificial sequence 

<223> Description of the artificial sequence: coding for 
cytosine deaminase (codA) 

<400> 4 

Met Ser Asn Asn Ala Leu Gin Thr lie lie Asn Ala Arg Leu Pro Gly 
15 10 15 

Glu Glu Gly Leu Trp Gin lie His Leu Gin Asp Gly Lys lie Ser Ala 
20 25 30 

lie Asp Ala Gin Ser Gly Val Met Pro lie Thr Glu Asn Ser Leu Asp 
35 40 45 

Ala Glu Gin Gly Leu Val lie Pro Pro Phe Val Glu Pro His lie His 
50 55 60 

Leu Asp Thr Thr Gin Thr Ala Gly Gin Pro Asn Trp Asn Gin Ser Gly 
65 70 75 80 

Thr Leu Phe Glu Gly lie Glu Arg Trp Ala Glu Arg Lys Ala Leu Leu 
85 90 95 

Thr His Asp Asp val Lys Gin Arg Ala Trp Gin Thr Leu Lys Trp Gin 
100 105 110 
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lie Ala Asn Gly lie Gin His val Arg Thr His val Asp val Ser Asp 
115 120 125 

Ala Thr Leu Thr Ala Leu Lys Ala Met Leu Glu Val Lys Gin Glu val 
130 135 140 

Ala Pro Trp lie Asp Leu Gin lie val Ala Phe Pro Gin Glu Gly lie 
145 150 155 160 

Leu Ser Tyr Pro Asn Gly Glu Ala Leu Leu Glu Glu Ala Leu Arg Leu 
165 170 175 

Gly Ala Asp val Val Gly Ala lie Pro His Phe Glu Phe Thr Arg Glu 
180 185 190 

Tyr Gly Val Glu Ser Leu His Lys Thr Phe Ala Leu Ala Gin Lys Tyr 
195 200 205 

Asp Arg Leu lie Asp val His Cys Asp Glu lie Asp Asp Glu Gin Ser 
210 215 220 

Arg Phe val Glu Thr Val Ala Ala Leu Ala His His Glu Gly Met Gly 
225 230 235 240 

Ala Arg Val Thr Ala Ser His Thr Thr Ala Met His Ser Tyr Asn Gly 
245 250 255 

Ala Tyr Thr ser Arg Leu Phe Arg Leu Leu Lys Met Ser Gly lie Asn 
260 265 270 

Phe val Ala Asn Pro Leu val Asn lie His Leu Gin Gly Arg Phe Asp 
275 280 285 

Thr Tyr Pro Lys Arg Arg Gly lie Thr Arg val Lys Glu Met Leu Glu 
290 ~ 295 300 

Ser Gly lie Asn val Cys Phe Gly His Asp Asp val Phe Asp Pro Trp 
305 310 315 320 

Tyr Pro Leu Gly Thr Ala Asn Met Leu Gin val Leu His Met Gly Leu 
325 330 335 

His val Cys Gin Leu Met Gly Tyr Gly Gin lie Asn Asp Gly Leu Asn 
340 345 350 

Leu lie Thr His His Ser Ala Arg Thr Leu Asn Leu Gin Asp Tyr Gly 
355 360 365 

lie Ala Ala Gly Asn ser Ala Asn Leu lie lie Leu Pro Ala Glu Asn 
370 375 380 

Gly Phe Asp Ala Leu Arg Arg Gin val Pro val Arg Tyr ser val Arg 
385 390 395 400 

Gly Gly Lys val lie Ala Ser Thr Gin Pro Ala Gin Thr Thr Val Tyr 
405 410 415 

Leu Glu Gin Pro Glu Ala lie Asp Tyr Lys Arg 
420 425 

<210> 5 
<211> 1221 
<212> DNA 
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<213> Streptomyces griseolus 

<220> 

<221> CDS 

<222> (1)..(1218) 

<223> coding for cytochrome P450-Sul (suae) 
<400> 5 

atg acc gat acc gec acg acg ccc cag acc acg gac gca ccc gec ttc 48 
Met Thr Asp Thr Ala Thr Thr Pro Gin Thr Thr Asp Ala Pro Ala Phe 
1 5 10 15 

ccg age aac egg age tgt ccc tac cag tta ccg gac ggc tac gec cag 96 
pro Ser Asn Arg Ser Cys Pro Tyr Gin Leu Pro Asp Gly Tyr Ala Gin 
20 25 30 

etc egg gac acc ccc ggc ccc ctg cac egg gtg acg etc tac gac ggc 144 
Leu Arg Asp Thr Pro Gly Pro Leu His Arg val Thr Leu Tyr Asp Gly 
35 40 45 

cgt cag gcg tgg gtg gtg acc aag cac gag gec gcg cgc aaa ctg etc 192 
Arg Gin Ala Trp Val Val Thr Lys His Glu Ala Ala Arg Lys Leu Leu 
50 55 60 

ggc gac ccc egg ctg tec tec aac egg acg gac gac aac ttc ccc gee 240 
Gly Asp Pro Arg Leu Ser Ser Asn Arg Thr Asp Asp Asn Phe Pro Ala 
65 70 75 80 

acg tea ccg cgc ttc gag gec gtc egg gag age ccg cag gcg ttc ate 288 
Thr ser Pro Arg Phe Glu Ala val Arg Glu Ser Pro Gin Ala Phe lie 
85 90 95 

ggc ctg gac ccg ccc gag cac ggc acc egg egg egg atg acg ate age 336 
Gly Leu Asp Pro Pro Glu His Gly Thr Arg Arg Arg Met Thr lie Ser 
100 105 ~ 110 

gag ttc acc gtc aag egg ate aag ggc atg cgc ccc gag gtc gag gag 384 
Glu Phe Thr val Lys Arg lie Lys Gly Met Arg Pro Glu val Glu Glu 
115 120 125 



gtg gtg cac ggc ttc etc gac gag atg ctg gec gec ggc ccg acc gee 
Val val His Gly Phe Leu Asp Glu Met Leu Ala Ala Gly Pro Thr Ala 
130 135 140 



acc gaa ccg ggc gcg ggc ctg gtg ggc get ctg gtc gee gac cag ctg 
Thr Glu Pro Gly Ala Gly Leu val Gly Ala Leu Val Ala Asp Gin Leu 
210 215 220 
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432 



gac ctg gtc agt cag ttc gcg ctg ccg gtg ccc tec atg gtg ate tgc 480 

Asp Leu val Ser Gin Phe Ala Leu Pro val Pro Ser Met val lie Cys 

145 150 155 160 

cga etc etc ggc gtg ccc tac gee gac cac gag ttc ttc cag gac gcg 528 

Arg Leu Leu Gly val Pro Tyr Ala Asp His Glu Phe Phe Gin Asp Ala 
165 170 175 

age aag egg ctg gtg cag tec acg gac gcg cag age gcg etc acc gcg 576 

Ser Lys Arg Leu val Gin ser Thr Asp Ala Gin Ser Ala Leu Thr Ala 
180 185 190 

egg aac gac etc gcg ggt tac ctg gac ggc etc ate acc cag ttc cag 624 

Arg Asn Asp Leu Ala Gly Tyr Leu Asp Gly Leu lie Thr Gin Phe Gin 
195 200 205 



672 



sequence listing 

gcc aac ggc gag ate gac cgt gag gaa ctg ate tec acc gcg atg ctg 

Ala Asn Gly Glu lie Asp Arg Glu Glu Leu lie Ser Thr Ala Met Leu 

225 230 235 240 

etc etc ate gcc ggc cac gag acc acg gcc teg atg acc tec etc age 

Leu Leu lie Ala Gly His Glu Thr Thr Ala Ser Met Thr ser Leu Ser 
245 250 255 



gtc gag ggg cac etc ate egg gcc ggc gag ggc gtg ate gtc gtc aac 
val Glu Gly His Leu lie Arg Ala Gly Glu Gly Val lie Val Val Asn 
305 310 - 315 320 



720 



768 



gtg ate acc ctg ctg gac cac ccc gag cag tac gcc gcc ctg cgc gcc 816 

Val lie Thr Leu Leu Asp His Pro Glu Gin Tyr Ala Ala Leu Arg Ala 
260 265 270 

gac cgc age etc gtq ccc ggc gcg gtg gag gaa ctg etc cgc tac etc 864 

Asp Arg Ser Leu val Pro Gly Ala val Glu Glu Leu Leu Arg Tyr Leu 
275 280 285 

gcc ate gcc gac ate gcg ggc ggc cgc gtc gcc acg gcg gac ate gag 912 

Ala lie Ala Asp lie Ala Gly Gly Arg Val Ala Thr Ala Asp lie Glu 
290 295 300 



960 



teg ata gcc aac egg gac ggc acg gtg tac gag gac ccg gac gcc etc 1008 
Ser lie Ala Asn Arg Asp Gly Thr val Tyr Glu Asp Pro Asp Ala Leu 
325 330 335 

gac ate cac cgc tec gcg cgc cac cac etc gcc ttc ggc ttc ggc gtg 1056 
Asp lie His Arg Ser Ala Arg His His Leu Ala Phe Gly Phe Gly Val 
340 345 350 

cac cag tgc ctg ggc cag aac etc gcc egg ctg gag ctg gag gtc ate 1104 
His Gin Cys Leu Gly Gin Asn Leu Ala Arg Leu Glu Leu Glu Val lie 
355 360 365 

etc aac gcc etc atg gac cgc gtc ccg acg ctg cga ctg gcc gtc ccc 1152 
Leu Asn Ala Leu Met Asp Arg val Pro Thr Leu Arg Leu Ala Val Pro 
370 375 380 

gtc gag cag ttg gtg ctg egg ccg ggt acg acg ate cag ggc gtc aac 1200 
val Glu Gin Leu val Leu Arg Pro Gly Thr Thr lie Gin Gly Val Asn 
385 390 395 400 

gaa etc ccg gtc acc tgg tga 1221 
Glu Leu Pro Val Thr Trp 
405 

<210> 6 
<211> 406 
<212> PRT 

<213> Streptomyces griseolus 
<400> 6 

Met Thr Asp Thr Ala Thr Thr Pro Gin Thr Thr Asp Ala Pro Ala Phe 
15 10 15 

Pro Ser Asn Arg Ser Cys Pro Tyr Gin Leu Pro Asp Gly Tyr Ala Gin 
20 * 25 30 

Leu Arg Asp Thr pro Gly Pro Leu His Arg val Thr Leu Tyr Asp Gly 
35 40 45 
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Arg Gin Ala Trp val val Thr Lys His Glu Ala Ala Arg Lys Leu Leu 
50 55 60 

Gly Asp Pro Arg Leu Ser Ser Asn Arg Thr Asp Asp Asn Phe Pro Ala 
65 " 70 ~ 75 80 

Thr Ser Pro Arg Phe Glu Ala Val Arg Glu Ser Pro Gin Ala Phe lie 
85 90 95 

Gly Leu Asp Pro Pro Glu His Gly Thr Arg Arg Arg Met Thr lie Ser 
100 105 ~ 110 

Glu Phe Thr Val Lys Arg lie Lys Gly Met Arg Pro Glu Val Glu Glu 
115 ~ 120 125 

Val val His Gly Phe Leu Asp Glu Met Leu Ala Ala Gly Pro Thr Ala 
130 135 140 

Asp Leu Val Ser Gin Phe Ala Leu Pro Val Pro Ser Met Val lie Cys 
145 150 155 160 

Arg Leu Leu Gly Val Pro Tyr Ala Asp His Glu Phe Phe Gin Asp Ala 
165 170 175 

Ser Lys Arg Leu val Gin Ser Thr Asp Ala Gin Ser Ala Leu Thr Ala 
180 185 190 

Arg Asn Asp Leu Ala Gly Tyr Leu Asp Gly Leu lie Thr Gin Phe Gin 
195 200 205 

Thr Glu Pro Gly Ala Gly Leu Val Gly Ala Leu val Ala Asp Gin Leu 
210 215 220 

Ala Asn Gly Glu lie Asp Arg Glu Glu Leu lie Ser Thr Ala Met Leu 
225 ' 230 235 240 

Leu Leu lie Ala Gly His Glu Thr Thr Ala Ser Met Thr Ser Leu Ser 
245 250 255 

Val lie Thr Leu Leu Asp His Pro Glu Gin Tyr Ala Ala Leu Arg Ala 
260 265 270 

Asp Arg Ser Leu Val Pro Gly Ala Val Glu Glu Leu Leu Arg Tyr Leu 
275 280 285 

Ala lie Ala Asp lie Ala Gly Gly Arg Val Ala Thr Ala Asp lie Glu 
290 295 300 

val Glu Gly His Leu lie Arg Ala Gly Glu Gly val lie val val Asn 
305 310 315 320 

Ser lie Ala Asn Arg Asp Gly Thr val Tyr Glu Asp Pro Asp Ala Leu 
325 330 335 

Asp lie His Arg Ser Ala Arg His His Leu Ala Phe Gly Phe Gly val 
340 345 350 

His Gin Cys Leu Gly Gin Asn Leu Ala Arg Leu Glu Leu Glu Val lie 
355 360 ~* 365 

Leu Asn Ala Leu Met Asp Arg Val Pro Thr Leu Arg Leu Ala Val Pro 
370 375 380 
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val Glu Gin Leu val Leu Arg Pro Gly Thr Thr lie Gin Gly val Asn 
385 390 395 400 

Glu Leu Pro Val Thr Trp 
405 

<210> 7 
<211> 1404 
<212> DNA 

<213> Agrobacterium tumefaciens 

<220> 

<221> CDS 

<222> (1) . . (1401) 

<223> coding for indoleacetamide hydrolase (tms2) 
<400> 7 

atg gtg ccc att acc teg tta gca caa acc eta gaa cgc ctg aga egg 48 
Met Val Pro lie Thr Ser Leu Ala Gin Thr Leu Glu Arg Leu Arg Arg 
1 5 10 15 

aaa gac tac tec tgc tta gaa eta gta gaa act ctg ata gcg cgt tgc 96 
Lys Asp Tyr Ser Cys Leu Glu Leu Val Glu Thr Leu lie Ala Arg Cys 
20 25 30 

caa get gca aaa cca tta aat gee ctt ctg get aca gac tgg gat ggc 144 
Gin Ala Ala Lys Pro Leu Asn Ala Leu Leu Ala Thr Asp Trp Asp Gly 
35 40 45 

ttg egg cga age gee aaa aaa att gat cgt cat gga aac gec gga tta 192 
Leu Arg Arg Ser Ala Lys Lys lie Asp Arg His Gly Asn Ala Gly Leu 
50 ' 55 60 

ggt ctt tgc gqc att cca etc tgt ttt aag gcg aac ate gcg acc gqc 240 
Gly Leu Cys Gly lie Pro Leu Cys Phe Lys Ala Asn lie Ala Thr Gly 
65 70 75 80 

ata ttt cct aca age get get act ccg gcg ctg ata aac cac ttg cca 288 
lie Phe Pro Thr Ser Ala Ala Thr Pro Ala Leu lie Asn His Leu Pro 
85 90 95 

aag ata cca tec cgc gtc gca gaa aga ctt ttt tea get gga gca ctg 336 
Lys lie Pro Ser Arg Val Ala Glu Arg Leu Phe Ser Ala Gly Ala Leu 
100 ~ 105 110 

ccg ggt gee teg gga aac atg cat gag tta teg ttt gga att acg age 384 
Pro Gly Ala Ser Gly Asn Met His Glu Leu ser Phe Gly lie Thr ser 
115 120 125 



130 135 

ata cca gga ggc tea age ggt g 

lie Pro Gly Gly Ser ser Gly G 

145 150 155 160 

ttg atg tta gqc gqc ata gqc a 

Leu Met Leu Gly Gly lie Gly T 

165 170 175 



egg 


aac 


ccg 


tgg 


Arg 


Asn 


Pro 


Trp 






140 


gtg 


get 


get 


gcg 


val 


Ala 


Ala 


Ala 






155 




gat 


acc 


ggt 


gca 


Asp 


Thr 


Gly 


Ala 



432 
480 

528 



ccc gca gec ctg tgt ggc gta gta gqa ttt cga ccg acg ctt get cga 576 

Pro Ala Ala Leu Cys Gly val val Gly Phe Arg Pro Thr Leu Ala Arg 

180 , 185 ~ 190 

tat cca aga gat egg ata ata ccg gtc age ccc acc egg gac acc gee 624 
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sequence listing 
Tyr Pro Arg Asp Arg lie lie Pro val Ser Pro Thr Arg Asp Thr Ala 
195 200 205 

gga ate ata gcg cag tgc gta gec gat gtt ata ate etc gac cag gtg 
Gly lie lie Ala Gin cys val Ala Asp val lie lie Leu Asp Gin val 
210 215 220 



aag tat etc gac gat ttt gtg gqa aca gtt tct ttt tct gac gtt ate 
Lys Tyr Leu Asp Asp Phe val Gly Thr val Ser Phe Ser Asp val lie 
305 310 315 320 



ttc aag ate tac gtg cga aat gtg gac cca age age aac gca ggc eta 
Phe Lys lie Tyr Val Arg Asn val Asp Pro Ser Ser Asn Ala Gly Leu 
405 410 415 



672 



att tec gqa egg teg gcg aaa att tea ccc atg ccg ctg aag ggg ctt 720 

lie Ser Gly Arg Ser Ala Lys lie Ser Pro Met Pro Leu Lys Gly Leu 

225 230 235 240 

egg ate ggc etc ccc act ace tac ttt tac gat gac ctt gat get gat 768 

Arg lie Gly Leu Pro Thr Thr Tyr Phe Tyr Asp Asp Leu Asp Ala Asp 

245 250 255 

gtg gee ttc gca get gaa acg acg att cgc ttg eta gee aac aga ggc 816 

val Ala Phe Ala Ala Glu Thr Thr lie Arg Leu Leu Ala Asn Arg Gly 

260 265 270 

gta acc ttt gtt gaa gee gac ate ccc cac eta gag gaa ctg aat agt 864 

val Thr Phe Val Glu Ala Asp lie Pro His Leu Glu Glu Leu Asn Ser 

275 280 285 

ggg gca agt ttg cca att gcg ctt tac gaa ttt cca cac get eta aaa 912 

Gly Ala Ser Leu Pro lie Ala Leu Tyr Glu Phe Pro His Ala Leu Lys 

290 295 300 



960 



aaa gga att cgt age ccc gat gta gcg aac att gtc agt gcg caa att 1008 

Lys Gly lie Arg Ser Pro Asp val Ala Asn lie Val Ser Ala Gin lie 

325 330 335 

gat ggg cat caa att tec aac gat gaa tat gaa ctg gcg cgt caa tec 1056 

Asp Gly His Gin lie Ser Asn Asp Glu Tyr Glu Leu Ala Arg Gin ser 

340 345 350 

ttc agg cca agg etc cag gee act tat egg aat tac ttc aga etc tat 1104 

Phe Arg Pro Arg Leu Gin Ala Thr Tyr Arg Asn Tyr Phe Arg Leu Tyr 

355 360 365 

cag tta gat gca ate ctt ttc cca act gca ccc tta gcg gec aaa gec 1152 

Gin Leu Asp Ala lie Leu Phe Pro Thr Ala Pro Leu Ala Ala Lys Ala 

370 375 380 

ata gqt cag gag teg tea gtc ate cac aat ggc tea atg atg aac act 1200 

lie Gly Gin Glu Ser Ser val lie His Asn Gly Ser Met Met Asn Thr 

385 390 395 400 



1248 



cct ggg ttg age ctt cct gec tgc ctt aca cct gat cgc ttg cct gtt 1296 

Pro Gly Leu Ser Leu Pro Ala Cys Leu Thr Pro Asp Arg Leu Pro val 

420 425 430 

gga atg gaa att gat gga tta gcg ggg tea gac cac cgt ctg tta gca 1344 

Gly Met Glu lie Asp Gly Leu Ala Gly Ser Asp His Arg Leu Leu Ala 

435 440 445 
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ate gqg gca gca tta gaa aaa gec ata aat ttt cct tec ttt ccc gat 1392 
lie Gly Ala Ala Leu Glu Lys Ala lie Asn Phe Pro Ser Phe Pro Asp 
450 455 460 

get ttt aat tag 1404 

Ala Phe Asn 

465 

<210> 8 
<211> 467 
<212> PRT 

<213> Agrobacterium tumefaciens 
<400> 8 

Met val Pro lie Thr ser Leu Ala Gin Thr Leu Glu Arg Leu Arg Arg 
15 10 15 

Lys Asp Tyr Ser Cys Leu Glu Leu val Glu Thr Leu lie Ala Arg Cys 
20 25 30 

Gin Ala Ala Lys Pro Leu Asn Ala Leu Leu Ala Thr Asp Trp Asp Gly 
35 40 45 

Leu Arg Arg Ser Ala Lys Lys lie Asp Arg His Gly Asn Ala Gly Leu 
50 55 60 

Gly Leu cys Gly lie Pro Leu Cys Phe Lys Ala Asn lie Ala Thr Gly 
65 * 70 75 80 

lie Phe Pro Thr ser Ala Ala Thr Pro Ala Leu lie Asn His Leu Pro 
85 90 95 

Lys He Pro Ser Arg Val Ala Glu Arg Leu Phe Ser Ala Gly Ala Leu 
100 ~ 105 110 

Pro Gly Ala Ser Gly Asn Met His Glu Leu Ser Phe Gly lie Thr ser 
115 120 125 

Asn Asn Tyr Ala Thr Gly Ala val Arg Asn Pro Trp Asn Pro ser Leu 
130 135 140 

lie Pro Gly Gly Ser Ser Gly Gly val Ala Ala Ala val Ala Ser Arg 
145 150 155 160 

Leu Met Leu Gly Gly He Gly Thr Asp Thr Gly Ala Ser Val Arg Leu 
165 170 175 

Pro Ala Ala Leu Cys Gly val val Gly Phe Arg Pro Thr Leu Ala Arg 
180 185 190 

Tyr Pro Arg Asp Arg lie lie Pro val Ser Pro Thr Arg Asp Thr Ala 
195 ~ 200 205 

Gly lie lie Ala Gin Cys val Ala Asp val lie lie Leu Asp Gin val 
210 215 220 

lie Ser Gly Arg ser Ala Lys lie ser Pro Met Pro Leu Lys Gly Leu 
225 ~ 230 235 240 

Arg lie Gly Leu Pro Thr Thr Tyr Phe Tyr Asp Asp Leu Asp Ala Asp 
245 250 255 
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sequence listing 
val Ala Phe Ala Ala Glu Thr Thr lie Arg Leu Leu Ala Asn Arg Gly 
260 265 270 

val Thr Phe val Glu Ala Asp lie Pro His Leu Glu Glu Leu Asn Ser 
275 280 285 

Gly Ala Ser Leu Pro lie Ala Leu Tyr Glu Phe Pro His Ala Leu Lys 
290 295 300 

Lys Tyr Leu Asp Asp Phe val Gly Thr Val Ser Phe Ser Asp val lie 
305 310 * 315 320 

Lys Gly lie Arg Ser Pro Asp Val Ala Asn lie Val Ser Ala Gin lie 
325 330 335 

Asp Gly His Gin lie Ser Asn Asp Glu Tyr Glu Leu Ala Arg Gin Ser 
340 345 350 

Phe Arg Pro Arg Leu Gin Ala Thr Tyr Arg Asn Tyr Phe Arg Leu Tyr 
355 360 365 

Gin Leu Asp Ala lie Leu Phe Pro Thr Ala Pro Leu Ala Ala Lys Ala 
370 375 380 

lie Gly Gin Glu Ser Ser Val lie His Asn Gly Ser Met Met Asn Thr 
385 390 395 400 

Phe Lys lie Tyr val Arg Asn val Asp Pro Ser Ser Asn Ala Gly Leu 
405 410 415 

Pro Gly Leu Ser Leu Pro Ala Cys Leu Thr Pro Asp Arg Leu Pro Val 
420 425 430 

Gly Met Glu lie Asp Gly Leu Ala Gly Ser Asp His Arg Leu Leu Ala 
435 440 445 

lie Gly Ala Ala Leu Glu Lys Ala lie Asn Phe Pro Ser Phe Pro Asp 
450 455 460 

Ala Phe Asn 
465 

<210> 9 
<211> 1404 
<212> DNA 

<213> Agrobacterium tumefaciens 

<220> 

<221> CDS 

<222> (1)..(1401) 

<223> coding for indoleacetamide hydrolase (tms2) 
<400> 9 

atg gtq ccc att acc teg tta gca caa acc eta gaa cgc ctg aga egg 48 
Met Val Pro lie Thr Ser Leu Ala Gin Thr Leu Glu Arg Leu Arg Arg 
1 5 10 15 

aaa gac tac tec tgc tta gaa eta gta gaa act ctg ata gcg cgt tgc 96 
Lys Asp Tyr Ser Cys Leu Glu Leu Val Glu Thr Leu lie Ala Arg Cys 
20 25 30 

caa get gca aaa cca tta aat gee ctt ctg get aca gac tgg gat ggc 144 
Gin Ala Ala Lys Pro Leu Asn Ala Leu Leu Ala Thr Asp Trp Asp Gly 
35 40 45 
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ttg egg cga age gec aaa aaa att gat cgt cat gqa aac gec gga tta 

Leu Arg Arg Ser Ala Lys Lys lie Asp Arg His Gly Asn Ala Gly Leu 

50 ~ 55 60 

ggt ctt tgc ggc att cca etc tgt ttt aag gcg aac ate gcg acc ggc 

Gly Leu cys Gly lie Pro Leu Cys Phe Lys Ala Asn lie Ala Thr Gly 

65 70 75 80 



ccc 


gca 


gee 


ctg 


tgt 


Pro 


Ala 


Ala 


Leu 


Cys 








180 


tat 


cca 


aga 


gat 


egg 


Tyr 


Pro 


Arg 
195 
ata 


Asp 


Arg 


gqa 


ate 


gcg 


cag 


Gly 


He 


He 


Ala 


Gin 



jgc gta gta gqa ttt cga ccg acg ctt 
5ly Val Val Gly Phe Arg Pro Thr Leu 
185 190 



ata 


ccg 


gtc 


age 


He 


Pro 


val 


Ser 




200 






gta 


gee 


gat 


gtt 


val 


Ala 


Asp 


val 


215 






aaa 


att 


tea 


ccc 


Lys 


He 


Ser 


Pro 



210 

att tec gga egg teg g 
lie Ser Gly Arg Ser a 

225 230 " 235 240 



245 250 255 



ccc 


acc 


egg 


gac 


Pro 


Thr 


Arg 


Asp 






205 




ata 


ate 


etc 


gat 


He 


He 


Leu 


Asp 




220 




atg 


ccg 


ctg 


aag 


Met 


Pro 


Leu 


Lys 


235 






gat 


gac 


ctt 


gat 


Asp 


Asp 


Leu 


Asp 


ttg 


eta 


gee 


aac 


Leu 


Leu 


Ala 


Asn 



192 



240 



ata ttt cct aca age get get act ccg gcg ctg ata aac cac ttg cca 288 

lie Phe Pro Thr ser Ala Ala Thr Pro Ala Leu lie Asn His Leu Pro 
85 90 95 

aag ata cca tec cgc gtc gca gaa aga ctt ttt tea get gga gca ctg 336 

Lys lie Pro Ser Arg val Ala Glu Arg Leu Phe Ser Ala Gly Ala Leu 

100 ~* 105 110 

ccg ggt gec teg gqa aac atg cat gag tta teg ttt gqa att acg age 384 

pro Gly Ala Ser Gly Asn Met His Glu Leu ser Phe Gly lie Thr Ser 

115 120 125 

aac aac tat gec acc ggt gcg gtg egg aac ccg tgg aat cca agt ctg 432 

Asn Asn Tyr Ala Thr Gly Ala Val Arg Asn Pro Trp Asn Pro Ser Leu 
130 135 140 

ata cca gga gqc tea age ggt ggt gtg get get gcg gtq gca age cga 

lie Pro Gly Gly Ser Ser Gly Gly val Ala Ala Ala Val Ala Ser Arg 

145 150 155 160 



480 



ttg atg tta ggc ggc ata ggc acc gat acc ggt gca tct gtt cgc eta 528 
Leu Met Leu Gly Gly lie Gly Thr Asp Thr Gly Ala Ser val Arg Leu 
165 170 175 



576 

624 
672 

720 

768 



gtg gec ttc gca get gaa acg acg att cgc ttg eta gec aac aga ggc 816 

val Ala Phe Ala Ala Glu Thr Thr lie Arg Leu Leu Ala Asn Arg Gly 
260 265 270 

gta acc ttt gtt gaa gee gac ate ccc cac eta gag gaa ctg aat agt 864 

val Thr Phe Val Glu Ala Asp lie Pro His Leu Glu Glu Leu Asn Ser 
275 280 285 

ggg gca agt ttg cca att gcg ctt tac gaa ttt cca cac get eta aaa 912 

Gly Ala Ser Leu Pro lie Ala Leu Tyr Glu Phe Pro His Ala Leu Lys 
290 295 300 
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aag tat etc gac gat ttt gtg gga aca gtt tct ttt tct gac gtt ate 
Lys Tyr Leu Asp Asp Phe val Gly Thr val Ser Phe Ser Asp val lie 
305 310 315 320 



960 



aaa gga att cgt age ccc gat gta gcg aac att gtc agt gcg caa att 1008 
Lys Gly lie Arg Ser Pro Asp val Ala Asn lie Val Ser Ala Gin lie 
325 330 335 



gat ggg cat caa att tec aac gat gaa tat gaa ctg gcg cgt caa tec 1056 
Asp Gly His Gin lie Ser Asn Asp Glu Tyr Glu Leu Ala Arg Gin ser 
340 345 350 

ttc agg cca agg etc cag gee act tat egg aat tac ttc aga etc tat 1104 
Phe Arg Pro Arg Leu Gin Ala Thr Tyr Arg Asn Tyr Phe Arg Leu Tyr 
355 ~ 360 365 

cag tta gat gca ate ctt ttc cca act gca ccc tta gcg gee aaa gee 1152 
Gin Leu Asp Ala lie Leu Phe Pro Thr Ala Pro Leu Ala Ala Lys Ala 
370 375 380 

ata gqt cag gag teg tea gtc ate cac aat ggc tea atg ata aac act 1200 
lie Gly Gin Glu Ser ser Val lie His Asn Gly Ser Met lie Asn Thr 
385 390 395 400 

ttc aag ate tac gtg cga aat gtg gac cca age age aac gca ggc eta 1248 
Phe Lys lie Tyr Val Arg Asn val Asp Pro Ser ser Asn Ala Gly Leu 
405 410 415 

cct ggg ttg age ctt cct gee tgc ctt aca cct gat cgc ttg cct gtt 1296 
Pro Gly Leu Ser Leu Pro Ala Cys Leu Thr Pro Asp Arg Leu Pro Val 
420 425 430 

gga atg gaa att gac gga tta gcg ggg tea gac cac cgt ctg tta gca 1344 
Gly Met Glu lie Asp Gly Leu Ala Gly Ser Asp His Arg Leu Leu Ala 
435 440 445 

ate ggg gca gca tta gaa aaa gec ata aat ttt cct tec ttt ccc gat 1392 
He Gly Ala Ala Leu Glu Lys Ala lie Asn Phe Pro ser Phe Pro Asp 
450 455 460 

get ttt aat tag 1404 

Ala Phe Asn 

465 

<210> 10 
<211> 467 
<212> PRT 

<213> Agrobacterium tumefaciens 
<400> 10 

Met val Pro lie Thr ser Leu Ala Gin Thr Leu Glu Arg Leu Arg Arg 
1 5 10 ~ 15 

Lys Asp Tyr Ser Cys Leu Glu Leu val Glu Thr Leu lie Ala Arg Cys 
20 25 30 

Gin Ala Ala Lys Pro Leu Asn Ala Leu Leu Ala Thr Asp Trp Asp Gly 
35 40 45 

Leu Arg Arg ser Ala Lys Lys lie Asp Arg His Gly Asn Ala Gly Leu 
50 55 60 
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Gly Leu Cys Gly lie Pro Leu Cys Phe Lys Ala Asn lie Ala Thr Gly 
65 70 75 80 

lie Phe Pro Thr Ser Ala Ala Thr Pro Ala Leu lie Asn His Leu Pro 
85 90 95 

Lys lie Pro Ser Arg val Ala Glu Arg Leu Phe Ser Ala Gly Ala Leu 
100 105 110 

Pro Gly Ala Ser Gly Asn Met His Glu Leu Ser Phe Gly lie Thr Ser 
115 120 125 

Asn Asn Tyr Ala Thr Gly Ala Val Arg Asn Pro Trp Asn Pro Ser Leu 
130 135 ~ 140 

lie Pro Gly Gly Ser Ser Gly Gly Val Ala Ala Ala val Ala ser Arg 
145 150 155 160 

Leu Met Leu Gly Gly lie Gly Thr Asp Thr Gly Ala Ser val Arg Leu 
165 * 170 175 

Pro Ala Ala Leu Cys Gly Val val Gly Phe Arg Pro Thr Leu Ala Arg 
180 185 190 

Tyr Pro Arg Asp Arg lie lie Pro val Ser Pro Thr Arg Asp Thr Ala 
195 200 205 

Gly lie lie Ala Gin Cys Val Ala Asp Val lie lie Leu Asp Gin val 
210 215 220 

lie Ser Gly Arg Ser Ala Lys lie Ser Pro Met Pro Leu Lys Gly Leu 
225 230 235 240 

Arg lie Gly Leu Pro Thr Thr Tyr Phe Tyr Asp Asp Leu Asp Ala Asp 
245 250 255 

Val Ala Phe Ala Ala Glu Thr Thr lie Arg Leu Leu Ala Asn Arg Gly 
260 265 270 

val Thr Phe val Glu Ala Asp lie Pro His Leu Glu Glu Leu Asn Ser 
275 280 285 

Gly Ala Ser Leu Pro lie Ala Leu Tyr Glu Phe Pro His Ala Leu Lys 
290 295 300 

Lys Tyr Leu Asp Asp Phe Val Gly Thr val Ser Phe Ser Asp val lie 
305 310 315 320 

Lys Gly lie Arg Ser Pro Asp val Ala Asn lie Val Ser Ala Gin lie 
325 330 335 

Asp Gly His Gin lie Ser Asn Asp Glu Tyr Glu Leu Ala Arg Gin Ser 
340 345 350 

Phe Arg Pro Arg Leu Gin Ala Thr Tyr Arg Asn Tyr Phe Arg Leu Tyr 
355 360 365 

Gin Leu Asp Ala lie Leu Phe Pro Thr Ala Pro Leu Ala Ala Lys Ala 
370 375 380 

lie Gly Gin Glu Ser Ser val lie His Asn Gly ser Met lie Asn Thr 
385 390 395 400 
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Phe Lys lie Tyr val Arg Asn val Asp Pro Ser Ser Asn Ala Gly Leu 
405 ~ 410 415 

Pro Gly Leu Ser Leu Pro Ala Cys Leu Thr Pro Asp Arg Leu Pro val 
420 425 430 

Gly Met Glu lie Asp Gly Leu Ala Gly Ser Asp His Arg Leu Leu Ala 
435 440 445 

lie Gly Ala Ala Leu Glu Lys Ala lie Asn Phe Pro Ser Phe Pro Asp 
450 455 460 

Ala Phe Asn 
465 

<210> 11 
<211> 609 
<212> DNA 

<213> Xanthobacter autotroph! cus 

<220> 

<221> CDS 

<222> (1)..(603) 

<223> coding for haloalkane dehalogenase 
<400> 11 

atg tea acg ttt ttt gaa ccg gag aac gga atg aaa caa aac gec aaa 48 
Met Ser Thr Phe Phe Glu Pro Glu Asn Gly Met Lys Gin Asn Ala Lys 
1 5 10 15 

acc gaa cga ate ctg gat gtc gcg etc gaa ttg ctt gag aca gag ggt 96 
Thr Glu Arg lie Leu Asp Val Ala Leu Glu Leu Leu Glu Thr Glu Gly 
20 25 30 

gag ttt ggt ttg acg atg agg cag gtq gca acg caa gcg gac atg tec 144 
Glu Phe Gly Leu Thr Met Arg Gin Val Ala Thr Gin Ala Asp Met Ser 
35 40 45 

ctg age aac gtt cag tac tat ttc aag tec gag gac ctg etc etc gtq 192 
Leu ser Asn val Gin Tyr Tyr Phe Lys Ser Glu Asp Leu Leu Leu val 
50 55 60 

gee atg gca gac cgt tac ttt caa egg tgc ctg aca acc atg get gag 240 
Ala Met Ala Asp Arg Tyr Phe Gin Arg Cys Leu Thr Thr Met Ala Glu 
65 70 75 80 

cat ccg ccc tta teg gca ggg cgt gat caa cac gee cag tta aga gcg 288 
His Pro Pro Leu Ser Ala Gly Arg Asp Gin His Ala Gin Leu Arg Ala 
85 " 90 95 

ttg tta cga gaa ctg etc ggt cat ggt ctt gag att tec gag atg tgt 336 
Leu Leu Arg Glu Leu Leu Gly His Gly Leu Glu lie Ser Glu Met Cys 
100 105 110 

cga ata ttc agg gag tac tgg gca ate gec acc cgt aat gaa act gtt 384 
Arg lie Phe Arg Glu Tyr Trp Ala lie Ala Thr Arg Asn Glu Thr Val 
115 120 125 

cac ggc tat etc aag teg tac tat egg gat etc gec gaa gtq atg get 432 
His Gly Tyr Leu Lys Ser Tyr Tyr Arg Asp Leu Ala Glu val Met Ala 
130 135 140 

gag aag ctt gcg cca ctg gec age age gaa aag gcg ctg gee gtg gee 480 
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Glu Lys Leu Ala Pro Leu Ala Ser Ser Glu Lys Ala Leu Ala Val Ala 
145 150 155 160 

gta tct ttg gtt att cct tat gtt gag gqg tat teg gta acg gec att 528 

val Ser Leu val lie Pro Tyr val Glu Gly Tyr Ser val Thr Ala lie 
165 170 175 



gca atg ccc gaa tec att gat acg att tec gag acg ctg acc aat gtq 
Ala Met Pro Glu Ser lie Asp Thr lie Ser Glu Thr Leu Thr Asn val 
180 185 190 



576 



gtq ttg gag cag ctt cgc ate age aat tcatga 609 
val Leu Glu Gin Leu Arg lie ser Asn 
195 200 

<210> 12 
<211> 201 
<212> PRT 

<213> Xanthobacter autotrophicus 
<400> 12 

Met Ser Thr Phe Phe Glu Pro Glu Asn Gly Met Lys Gin Asn Ala Lys 
1 5 10 15 

Thr Glu Arg lie Leu Asp Val Ala Leu Glu Leu Leu Glu Thr Glu Gly 
20 25 30 

Glu Phe Gly Leu Thr Met Arg Gin Val Ala Thr Gin Ala Asp Met Ser 
35 40 45 

Leu Ser Asn val Gin Tyr Tyr Phe Lys ser Glu Asp Leu Leu Leu Val 
50 55 60 

Ala Met Ala Asp Arg Tyr Phe Gin Arg Cys Leu Thr Thr Met Ala Glu 
65 70 75 80 

His Pro Pro Leu Ser Ala Gly Arg Asp Gin His Ala Gin Leu Arg Ala 
85 90 95 

Leu Leu Arg Glu Leu Leu Gly His Gly Leu Glu lie Ser Glu Met Cys 
100 105 110 

Arg lie Phe Arg Glu Tyr Trp Ala lie Ala Thr Arg Asn Glu Thr Val 
115 120 125 

His Gly Tyr Leu Lys Ser Tyr Tyr Arg Asp Leu Ala Glu Val Met Ala 
130 135 140 

Glu Lys Leu Ala Pro Leu Ala Ser Ser Glu Lys Ala Leu Ala val Ala 
145 150 155 160 

val Ser Leu Val lie Pro Tyr Val Glu Gly Tyr Ser Val Thr Ala lie 
165 170 175 

Ala Met Pro Glu Ser lie Asp Thr lie Ser Glu Thr Leu Thr Asn val 
180 185 190 

Val Leu Glu Gin Leu Arg lie Ser Asn 
195 200 

<210> 13 
<211> 1131 
<212> DNA 
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<213> Herpes simplex virus 1 

<220> 

<221> CDS 

<222> (1). .(1128) 

<223> coding for thymidine kinase (TK) 
<400> 13 

atg get teg tac ccc tgc cat caa cac gcg tct gcg ttc gac cag get 48 
Met Ala Ser Tyr Pro Cys His Gin His Ala Ser Ala Phe Asp Gin Ala 
15 10 15 

gcg cgt tct cgc ggc cat age aac cga cgt acg gcg ttg cgc cct cgc 96 
Ala Arg Ser Arg Gly His Ser Asn Arg Arg Thr Ala Leu Arg Pro Arg 
20 25 30 

egg cag caa gaa gee acg gaa gtc cgc ctg gag cag aaa atg ccc acg 144 
Arg Gin Gin Glu Ala Thr Glu val Arg Leu Glu Gin Lys Met Pro Thr 
35 40 45 

eta ctg egg gtt tat ata gac ggt cct cac ggg atg ggg aaa acc acc 192 
Leu Leu Arg val Tyr lie Asp Gly Pro His Gly Met Gly Lys Thr Thr 
50 55 60 

acc acg caa ctg ctg gtg gec ctg ggt teg cgc gac gat ate gtc tac 240 
Thr Thr Gin Leu Leu val Ala Leu Gly Ser Arg Asp Asp lie val Tyr 
65 70 75 80 

gta ccc gag ccg atg act tac tgg cag gtg ctg ggg get tec gag aca 288 
val Pro Glu Pro Met Thr Tyr Trp Gin Val Leu Gly Ala Ser Glu Thr 
85 90 95 



ate gcg aac ate tac acc aca caa cac cgc etc gac cag ggt gag ata 
lie Ala Asn lie Tyr Thr Thr Gin His Arg Leu Asp Gin Gly Glu lie 
100 105 " 110 



ggg gag get ggg agt tea cat gee ccg ccc ccg gec etc acc etc ate 
Gly Glu Ala Gly Ser Ser His Ala Pro Pro Pro Ala Leu Thr Leu lie 
145 150 155 160 



336 



teg gee ggg gac gcg gcg gtg gta atg aca age gec cag ata aca atg 384 

Ser Ala Gly Asp Ala Ala val Val Met Thr Ser Ala Gin lie Thr Met 

115 120 125 

ggc atg cct tat gee gtg acc gac gee gtt ctg get cct cat gtc ggg 432 

Gly Met Pro Tyr Ala val Thr Asp Ala val Leu Ala Pro His val Gly 

130 135 140 



480 



ttc gac cgc cat ccc ate gee gee etc ctg tgc tac ccg gec gcg cga 528 
Phe Asp Arg His Pro lie Ala Ala Leu Leu Cys Tyr Pro Ala Ala Arg 
165 170 175 

tac ctt atg ggc age atg acc ccc cag gec gtg ctg gcg ttc gtg gec 576 
Tyr Leu Met Gly Ser Met Thr Pro Gin Ala val Leu Ala Phe Val Ala 
180 185 190 

etc ate ccg ccg acc ttg ccc ggc aca aac ate gtg ttg ggg gee ctt 624 
Leu lie Pro Pro Thr Leu Pro Gly Thr Asn lie val Leu Gly Ala Leu 
195 200 205 

ccg gag gac aga cac ate gac cgc ctg gec aaa cgc cag cgc ccc ggc 672 
Pro Glu Asp Arg His lie Asp Arg Leu Ala Lys Arg Gin Arg Pro Gly 
210 215 220 
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gag egg ctt gac ctg get atg ctg gec gcg att cgc cgc gtt tac ggg 720 
Glu Arq Leu Asp Leu Ala Met Leu Ala Ala lie Arg Arg Val Tyr Gly 
225 ' 230 235 240 

ctg ctt gec aat acg gtg egg tat ctg cag ggc ggc ggg teg tgg tgg 768 
Leu Leu Ala Asn Thr val Arg Tyr Leu Gin Gly Gly Gly Ser Trp Trp 
245 250 255 

gag gat tgg gga cag ctt teg ggg acg gee gtg ccg ccc cag ggt gec 816 
Glu Asp Trp Gly Gin Leu Ser Gly Thr Ala Val Pro Pro Gin Gly Ala 
260 265 270 

gag ccc cag age aac gcg ggc cca cga ccc cat ate ggg gac acg tta 864 
Glu Pro Gin Ser Asn Ala Gly Pro Arg Pro His lie Gly Asp Thr Leu 
275 280 285 

ttt acc ctg ttt egg gec ccc gag ttg ctg gee ccc aac ggc gac ctg 912 
Phe Thr Leu Phe Arg Ala Pro Glu Leu Leu Ala Pro Asn Gly Asp Leu 
290 295 300 



tat aac gtg ttt gec tgg gec ttg gac gtc ttg gec aaa cgc etc cgt 
Tyr Asn val Phe Ala Trp Ala Leu Asp Val Leu Ala Lys Arg Leu Arg 
305 310 315 320 



960 



ccc atg cac gtc ttt ate ctg gat tac gac caa teg ccc gee ggc tgc 1008 
pro Met His Val Phe lie Leu Asp Tyr Asp Gin Ser Pro Ala Gly Cys 
325 330 335 

egg gac gee ctg ctg caa ctt acc tec ggg atg gtc cag acc cac gtc 1056 
Arg Asp Ala Leu Leu Gin Leu Thr Ser Gly Met Val Gin Thr His val 
340 345 350 

acc acc cca ggc tec ata ccg acg ate tgc gac ctg gcg cgc acg ttt 1104 
Thr Thr Pro Gly Ser lie Pro Thr lie Cys Asp Leu Ala Arg Thr Phe 
355 360 365 

gee egg gag atg ggg gag get aac tga 1131 
Ala Arg Glu Met Gly Glu Ala Asn 
370 375 

<210> 14 
<211> 376 
<212> PRT 

<213> Herpes simplex virus 1 
<400> 14 

Met Ala ser Tyr Pro Cys His Gin His Ala Ser Ala Phe Asp Gin Ala 
15 10 15 

Ala Arg Ser Arg Gly His Ser Asn Arg Arg Thr Ala Leu Arg Pro Arg 

20 1 25 30 

Arg Gin Gin Glu Ala Thr Glu val Arg Leu Glu Gin Lys Met Pro Thr 
35 40 45 

Leu Leu Arg val Tyr lie Asp Gly Pro His Gly Met Gly Lys Thr Thr 
50 55 60 

Thr Thr Gin Leu Leu val Ala Leu Gly Ser Arg Asp Asp He Val Tyr 
65 70 75 80 

val Pro Glu Pro Met Thr Tyr Trp Gin val Leu Gly Ala ser Glu Thr 
85 90 95 
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lie Ala Asn lie Tyr Thr Thr Gin His Arg Leu Asp Gin Gly Glu lie 
100 105 ~ 110 

Ser Ala Gly Asp Ala Ala val Val Met Thr Ser Ala Gin lie Thr Met 
115 120 125 

Gly Met Pro Tyr Ala Val Thr Asp Ala val Leu Ala Pro His Val Gly 
130 * 135 140 

Gly Glu Ala Gly Ser Ser His Ala Pro Pro Pro Ala Leu Thr Leu lie 
145 150 155 160 

Phe Asp Arg His Pro lie Ala Ala Leu Leu Cys Tyr Pro Ala Ala Arg 
165 170 175 

Tyr Leu Met Gly Ser Met Thr Pro Gin Ala Val Leu Ala Phe Val Ala 
180 185 190 

Leu lie Pro Pro Thr Leu Pro Gly Thr Asn lie Val Leu Gly Ala Leu 
195 200 205 

Pro Glu Asp Arg His lie Asp Arg Leu Ala Lys Arg Gin Arg Pro Gly 
210 215 220 

Glu Arg Leu Asp Leu Ala Met Leu Ala Ala lie Arg Arg val Tyr Gly 
225 230 235 240 

Leu Leu Ala Asn Thr Val Arg Tyr Leu Gin Gly Gly Gly Ser Trp Trp 
245 250 255 

Glu Asp Trp Gly Gin Leu Ser Gly Thr Ala val Pro Pro Gin Gly Ala 
260 265 270 

Glu Pro Gin Ser Asn Ala Gly Pro Arg Pro His lie Gly Asp Thr Leu 
275 280 ~ 285 

Phe Thr Leu Phe Arg Ala Pro Glu Leu Leu Ala Pro Asn Gly Asp Leu 
290 295 300 

Tyr Asn val Phe Ala Trp Ala Leu Asp val Leu Ala Lys Arg Leu Arg 
305 310 315 320 

pro Met His val Phe lie Leu Asp Tyr Asp Gin Ser Pro Ala Gly Cys 
325 330 335 

Arg Asp Ala Leu Leu Gin Leu Thr Ser Gly Met val Gin Thr His val 
340 345 350 

Thr Thr Pro Gly ser lie Pro Thr lie Cys Asp Leu Ala Arg Thr Phe 
355 360 365 

Ala Arg Glu Met Gly Glu Ala Asn 
370 375 



<210> 15 
<211> 1131 
<212> DNA 

<213> Herpes simplex virus 1 

<220> 

<221> CDS 

<222> (1)..(1128) 
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sequence listing 
<223> coding for thymidine kinase (TK) 

<400> 15 

atg get teg tac ccc tgc cat caa cac gcg tct gcg ttc gac cag get 48 

Met Ala Ser Tyr Pro Cys His Gin His Ala Ser Ala Phe Asp Gin Ala 
15 10 15 

gcg cgt tct cgc ggc cat age aac cga cgt acg gcg ttg cgc cct cgc 96 
Ala Arg Ser Arg Gly His Ser Asn Arg Arg Thr Ala Leu Arg Pro Arg 
20 25 30 

egg cag caa gaa gee acg gaa gtc cgc ctg gag cag aaa atg ccc acg 144 
Arg Gin Gin Glu Ala Thr Glu val Arg Leu Glu Gin Lys Met Pro Thr 
35 40 45 

eta ctg egg gtt tat ata gac ggt cct cac ggg atg ggg aaa ace ace 192 
Leu Leu Arg Val Tyr lie Asp Gly Pro His Gly Met Gly Lys Thr Thr 
50 55 60 

ace acg caa ctg ctg gtg gee ctg ggt teg cgc gac gat ate gtc tac 240 
Thr Thr Gin Leu Leu val Ala Leu Gly Ser Arg Asp Asp lie val Tyr 
65 70 75 80 

gta ccc gag ccg atg act tac tgg cag gtg ctg gqg get tec gag aca 288 
val Pro Glu Pro Met Thr Tyr Trp Gin val Leu Gly Ala Ser Glu Thr 
85 90 95 

ate gcg aac ate tac acc aca caa cac cgc etc gac cag ggt gag ata 336 
lie Ala Asn lie Tyr Thr Thr Gin His Arg Leu Asp Gin Gly Glu lie 
100 105 110 

teg gee gqg gac gcg gcg gtg gta atg aca age gec cag ata aca atg 384 
Ser Ala Gly Asp Ala Ala val val Met Thr ser Ala Gin lie Thr Met 
115 120 125 

ggc atg cct tat gec gtg acc gac gec gtt ctg get cct cat gtc gqg 432 
Gly Met Pro Tyr Ala val Thr Asp Ala val Leu Ala Pro His val Gly 
130 135 140 

gqg gag get gqg agt tea cat gec ccg ccc ccg gee etc acc etc ate 480 
Gly Glu Ala Gly Ser Ser His Ala Pro Pro Pro Ala Leu Thr Leu lie 
145 150 155 160 

ttc gac cgc cat ccc ate gec gec etc ctg tgc tac ccg gee gcg cga 528 
Phe Asp Arg His Pro lie Ala Ala Leu Leu Cys Tyr Pro Ala Ala Arg 
165 170 175 

tac ctt atg ggc age atg acc ccc cag gec gtg ctg gcg ttc gtg gec 576 
Tyr Leu Met Gly Ser Met Thr Pro Gin Ala val Leu Ala Phe Val Ala 
180 185 190 

etc ate ccg ccg acc ttg ccc ggc aca aac ate gtg ttg ggg gee ctt 624 
Leu lie Pro Pro Thr Leu Pro Gly Thr Asn lie val Leu Gly Ala Leu 
195 200 205 

ccg gag gac aga cac ate gac cgc ctg gee aaa cgc cag cgc ccc ggc 672 
Pro Glu Asp Arg His lie Asp Arg Leu Ala Lys Arg Gin Arg Pro Gly 
210 215 220 

gag egg ctt gac ctg get atg ctg gec gcg att cgc cgc gtt tac ggg 720 
Glu Arg Leu Asp Leu Ala Met Leu Ala Ala lie Arg Arg val Tyr Gly 
225 230 235 240 
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sequence listing 
ctg ctt gcc aat acg gtg egg tat ctg cag ggc ggc ggg teg tgg tgg 
Leu Leu Ala Asn Thr val Arg Tyr Leu Gin Gly Gly Gly Ser Trp Trp 
245 250 255 

gag gat tgg gqa cag ctt teg gqg acg gcc gtg ccg ccc cag gqt gcc 
Glu Asp Trp Gly Gin Leu Ser Gly Thr Ala val Pro Pro Gin Gly Ala 
260 265 270 



gcc egg gag atg ggg gag get aac tga 
Ala Arg Glu Met Gly Glu Ala Asn 
370 375 



<210> 16 
<211> 376 
<212> PRT 

<213> Herpes simplex virus 1 
<400> 16 



Met Ala Ser Tyr Pro Cys His Gin 
1 5 


His 


Ala 
10 


Ser 


Ala 


Phe 


Asp 


Gin 
15 


Ala 


Ala Arg Ser Arg Gly His Ser Asn 
20 


Arg 
25 


Arg 


Thr 


Ala 


Leu 


A 3 r 8 


Pro 


Arg 


Arg Gin Gin Glu Ala Thr Glu val 
35 40 


Arg 


Leu 


Glu 


Gin 


Lys 
45 


Met 


Pro 


Thr 


Leu Leu Arg val Tyr lie Asp Gly 
50 55 


Pro 


His 


Gly 


Met 
60 


Gly 


Lys 


Thr 


Thr 


Thr Thr Gin Leu Leu Val Ala Leu 
65 70 


Gly 


Ser 


Arg 
75 


Asp 


Asp 


lie 


val 


Tyr 
80 


Val Pro Glu Pro Met Thr Tyr Trp 
85 

lie Ala Asn lie Tyr Thr Thr Gin 
100 


Gin 

His 
105 


val 
90 
Arg 


Leu 
Leu 


Gly 

Asp 


Ala 
Gin 


Ser 

Gly 
110 


Glu 
95 
Glu 


Thr 
He 



768 



816 



gag ccc cag age aac gcg ggc cca cga ccc cat ate ggg gac acg tta 864 

Glu Pro Gin Ser Asn Ala Gly Pro Arg Pro His lie Gly Asp Thr Leu 

275 280 285 

ttt ace ctg ttt egg gcc ccc gag ttg ctg gcc ccc aac ggc gac ctg 912 

Phe Thr Leu Phe Arg Ala Pro Glu Leu Leu Ala Pro Asn Gly Asp Leu 

290 ~ 295 300 

tat aac gtg ttt gcc tgg gcc ttg gac gtc ttg gcc aaa cgc etc cgt 960 

Tyr Asn Val Phe Ala Trp Ala Leu Asp Val Leu Ala Lys Arg Leu Arg 

305 310 315 320 

ccc atg cac gtc ttt ate ctg gat tac gac caa teg ccc gcc ggc tgc 1008 

Pro Met His val Phe lie Leu Asp Tyr Asp Gin Ser Pro Ala Gly Cys 

325 330 335 

egg gac gcc ctg ctg caa ctt acc tec ggg atg gtc cag acc cac gtc 1056 

Arg Asp Ala Leu Leu Gin Leu Thr Ser Gly Met val Gin Thr His val 

340 345 350 

acc acc cca ggc tec ata ccg acg ate tgc gac ctg gcg cgc acg ttt 1104 

Thr Thr Pro Gly Ser lie Pro Thr He Cys Asp Leu Ala Arg Thr Phe 

355 360 365 



1131 
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sequence listing 
Ser Ala Gly Asp Ala Ala Val val Met Thr Ser Ala Gin lie Thr Met 
115 120 125 

Gly Met Pro Tyr Ala val Thr Asp Ala Val Leu Ala Pro His Val Gly 
130 135 140 

Gly Glu Ala Gly Ser Ser His Ala Pro Pro Pro Ala Leu Thr Leu lie 
145 150 155 160 

Phe Asp Arg His Pro lie Ala Ala Leu Leu Cys Tyr Pro Ala Ala Arg 
165 170 175 

Tyr Leu Met Gly Ser Met Thr Pro Gin Ala val Leu Ala Phe val Ala 

180 185 190 

Leu lie Pro Pro Thr Leu Pro Gly Thr Asn lie Val Leu Gly Ala Leu 
195 200 205 

pro Glu Asp Arg His lie Asp Arg Leu Ala Lys Arg Gin Arg Pro Gly 
210 " 215 ~ 220 

Glu Arg Leu Asp Leu Ala Met Leu Ala Ala lie Arg Arg Val Tyr Gly 
225 ~ 230 235 240 

Leu Leu Ala Asn Thr val Arg Tyr Leu Gin Gly Gly Gly Ser Trp Trp 
245 250 255 

Glu Asp Trp Gly Gin Leu ser Gly Thr Ala val Pro Pro Gin Gly Ala 
260 265 270 

Glu Pro Gin ser Asn Ala Gly Pro Arg Pro His lie Gly Asp Thr Leu 
275 280 285 

Phe Thr Leu Phe Arg Ala Pro Glu Leu Leu Ala Pro Asn Gly Asp Leu 
290 ~ 295 300 

Tyr Asn Val Phe Ala Trp Ala Leu Asp Val Leu Ala Lys Arg Leu Arg 
305 310 315 320 

Pro Met His Val Phe lie Leu Asp Tyr Asp Gin Ser Pro Ala Gly Cys 
325 330 335 

Arg Asp Ala Leu Leu Gin Leu Thr Ser Gly Met val Gin Thr His val 
340 345 350 

Thr Thr Pro Gly Ser lie Pro Thr lie Cys Asp Leu Ala Arg Thr Phe 
355 360 365 

Ala Arg Glu Met Gly Glu Ala Asn 
370 375 

<210> 17 
<211> 840 
<212> DNA 

<213> Toxoplasma gondii 

<220> 

<221> CDS 

<222> (1)..(837) 

<223> coding for hypoxanthine-xanthine-guanine 
phosphoribosy I transferase (HXGPRTase) 

<400> 17 

atg gcg tec aaa ccc att gaa gaa tec egg teg caa aaa egg agt gee 
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sequence listing 
Met Ala Ser Lys Pro lie Glu Glu Ser Arg Ser Gin Lys Arg Ser Ala 
1 5 10 15 

ttc tea gac ate ttc tgt tgt tgc act cct aat gaa ggg get ate gtg 
Phe Ser Asp lie Phe Cys Cys Cys Thr Pro Asn Glu Gly Ala lie Val 
20 25 30 



cca gcg cgc tec agt gca ctt caa gac tac ggc aag ggc aag ggc cgt 
pro Ala Arg ser ser Ala Leu Gin Asp Tyr Gly Lys Gly Lys Gly Arg 
50 55 60 



cac aga act tac ttc gqc gag gag ttg cac ate att tgc ate ctg aaa 
His Arg Thr Tyr Phe Gly Glu Glu Leu His lie lie Cys lie Leu Lys 
115 * 120 125 



96 



ccc agt gac cca atg gtc tec acc agt get cca gca cgc ace agt get 144 
Pro Ser Asp Pro Met Val ser Thr ser Ala Pro Ala Arg Thr ser Ala 
35 40 45 



192 



att gag ccc atg tat ate ccc gac aac acc ttc tac aac get gat gac 240 

lie Glu Pro Met Tyr lie Pro Asp Asn Thr Phe Tyr Asn Ala Asp Asp 

65 70 75 80 

ttt ctt gtg ccc ccc cac tgc aag ccc tac att gac aaa ate etc etc 288 

Phe Leu val Pro Pro His Cys Lys Pro Tyr lie Asp Lys lie Leu Leu 

85 90 95 

cct ggt gga ttg gtc aag gac aga gtt gag aag ttg gcg tat gac ate 336 

Pro Gly Gly Leu val Lys Asp Arg Val Glu Lys Leu Ala Tyr Asp lie 

100 105 110 



384 



ggc tct cgc gqc ttc ttc aac ctt ctg ate gac tac ctt gee acc ata 432 

Gly Ser Arg Gly Phe Phe Asn Leu Leu lie Asp Tyr Leu Ala Thr lie 
130 135 140 

cag aag tac agt ggt cgt gag tec age gtg ccc ccc ttc ttc gag cac 480 

Gin Lys Tyr Ser Gly Arg Glu Ser ser Val Pro Pro Phe Phe Glu His 

145 150 155 160 

tat gtc cgc ctg aag tec tac cag aac gac aac age aca ggc cag etc 528 

Tyr val Arg Leu Lys Ser Tyr Gin Asn Asp Asn ser Thr Gly Gin Leu 

165 170 175 

acc gtc ttg age gac gac ttg tea ate ttt cgc gac aag cac gtt ctg 576 

Thr val Leu Ser Asp Asp Leu Ser lie Phe Arg Asp Lys His Val Leu 

180 185 190 

att gtt gag gac ate gtc gac acc ggt ttc acc etc acc gag ttc ggt 624 

lie val Glu Asp lie Val Asp Thr Gly Phe Thr Leu Thr Glu Phe Gly 

195 200 205 

gag cgc ctg aaa gec gtc ggt ccc aag teg atg aga ate gec acc etc 672 

Glu Arg Leu Lys Ala val Gly Pro Lys Ser Met Arg lie Ala Thr Leu 
210 215 220 

gtc gag aag cgc aca gat cgc tec aac age ttg aag ggc gac ttc gtc 720 

val Glu Lys Arg Thr Asp Arg Ser Asn Ser Leu Lys Gly Asp Phe val 

225 230 235 240 

ggc ttc age att gaa gac gtc tgg ate gtt ggt tgc tgc tac gac ttc 768 

Gly Phe Ser He Glu Asp Val Trp lie val Gly Cys Cys Tyr Asp Phe 

245 250 255 
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aac gag atg ttc cgc gac ttc gac cac gtc gcc gtc ctg age gac gec 816 
Asn Glu Met Phe Arg Asp Phe Asp His val Ala val Leu Ser Asp Ala 
260 265 270 

get cgc aaa aag ttc gag aag taa 840 
Ala Arg Lys Lys Phe Glu Lys 
275 

<210> 18 
<211> 279 
<212> PRT 

<213> Toxoplasma gondii 
<400> 18 

Met Ala ser Lys Pro lie Glu Glu Ser Arg ser Gin Lys Arg Ser Ala 
1 5 10 15 

Phe Ser Asp lie Phe Cys Cys Cys Thr Pro Asn Glu Gly Ala lie val 
20 25 30 

Pro Ser Asp Pro Met Val Ser Thr ser Ala Pro Ala Arg Thr Ser Ala 
35 40 45 

Pro Ala Arg ser Ser Ala Leu Gin Asp Tyr Gly Lys Gly Lys Gly Arg 
50 55 60 

lie Glu Pro Met Tyr lie Pro Asp Asn Thr Phe Tyr Asn Ala Asp Asp 
65 70 75 80 

Phe Leu val Pro Pro His Cys Lys Pro Tyr lie Asp Lys lie Leu Leu 
85 90 95 

pro Gly Gly Leu Val Lys Asp Arg val Glu Lys Leu Ala Tyr Asp lie 
100 105 110 

His Arg Thr Tyr Phe Gly Glu Glu Leu His lie lie Cys lie Leu Lys 
115 120 125 

Gly Ser Arg Gly Phe Phe Asn Leu Leu lie Asp Tyr Leu Ala Thr lie 
130 135 140 

Gin Lys Tyr ser Gly Arg Glu Ser Ser Val Pro Pro Phe Phe Glu His 
145 150 155 160 

Tyr Val Arg Leu Lys Ser Tyr Gin Asn Asp Asn Ser Thr Gly Gin Leu 
165 170 175 

Thr val Leu Ser Asp Asp Leu Ser lie Phe Arg Asp Lys His val Leu 
180 185 190 

lie val Glu Asp lie val Asp Thr Gly Phe Thr Leu Thr Glu Phe Gly 
195 200 205 

Glu Arg Leu Lys Ala val Gly Pro Lys Ser Met Arg lie Ala Thr Leu 
210 215 220 

val Glu Lys Arg Thr Asp Arg ser Asn Ser Leu Lys Gly Asp Phe val 
225 230 235 240 

Gly Phe Ser lie Glu Asp Val Trp lie val Gly cys cys Tyr Asp Phe 
245 250 255 
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sequence listing 
Asn Glu Met Phe Arg Asp Phe Asp His Val Ala Val Leu Ser Asp Ala 
260 265 270 

Ala Arg Lys Lys Phe Glu Lys 
275 

<210> 19 
<211> 459 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)..(456) 

<223> coding for xanthine-guanine phosphoribosyl 
transferase (gpt) 

<400> 19 

atg age gaa aaa tac ate gtc acc tgg gac atg ttg cag ate cat gca 48 

Met ser Glu Lys Tyr lie val Thr Trp Asp Met Leu Gin lie His Ala 
15 10 15 

cgt aaa etc gca age cga ctg atg cct tct gaa caa tgg aaa ggc att 96 
Arg Lys Leu Ala Ser Arg Leu Met Pro Ser Glu Gin Trp Lys Gly lie 
20 25 30 

att gee gta age cgt ggc gqt ctg gta ccg gqt gcg tta ctg gcg cgt 144 
lie Ala val Ser Arg Gly Gly Leu val Pro Gly Ala Leu Leu Ala Arg 
35 40 45 

gaa ctg gqt att cgt cat gtc gat acc gtt tgt att tec age tac gat 192 
Glu Leu Gly lie Arg His val Asp Thr Val Cys lie Ser Ser Tyr Asp 
50 55 60 

cac gac aac cag cgc gag ctt aaa gtq ctg aaa cgc gca gaa ggc gat 240 
His Asp Asn Gin Arg Glu Leu Lys val Leu Lys Arg Ala Glu Gly Asp 
65 70 75 80 



ggc gaa ggc ttc ate gtt att gat gac ctg gtq gat acc gqt gqt act 
Gly Glu Gly Phe lie val lie Asp Asp Leu val Asp Thr Gly Gly Thr 
85 90 95 



288 



gcg gtt gcg att cgt gaa atg tat cca aaa gcg cac ttt gtc acc ate 336 
Ala val Ala lie Arg Glu Met Tyr Pro Lys Ala His Phe Val Thr lie 
100 105 110 

ttc gca aaa ccg get gqt cgt ccg ctg gtt gat gac tat gtt gtt gat 384 
Phe Ala Lys Pro Ala Gly Arg Pro Leu val Asp Asp Tyr val val Asp 
115 120 125 

ate ccg caa gat acc tgg att gaa cag ccg tgg gat atg ggc gtc gta 432 
lie Pro Gin Asp Thr Trp lie Glu Gin Pro Trp Asp Met Gly val Val 
130 135 140 

ttc gtc ccg cca ate tec gqt cgc taa 459 
Phe val Pro Pro lie Ser Gly Arg 
145 150 

<210> 20 
<211> 152 
<212> PRT 

<213> Escherichia coli 
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<400> 20 

Met Ser Glu Lys Tyr lie val Thr Trp Asp Met Leu Gin lie His Ala 
15 10 15 

Arg Lys Leu Ala Ser Arg Leu Met Pro Ser Glu Gin Trp Lys Gly lie 
20 25 30 

lie Ala val Ser Arg Gly Gly Leu Val Pro Gly Ala Leu Leu Ala Arg 
35 ^ 40 45 

Glu Leu Gly lie Arg His Val Asp Thr Val Cys lie Ser ser Tyr Asp 
50 55 60 

His Asp Asn Gin Arg Glu Leu Lys val Leu Lys Arg Ala Glu Gly Asp 
65 70 75 80 

Gly Glu Gly Phe He val lie Asp Asp Leu val Asp Thr Gly Gly Thr 
85 90 95 

Ala val Ala lie Arg Glu Met Tyr Pro Lys Ala His Phe Val Thr lie 
100 105 110 

Phe Ala Lys Pro Ala Gly Arg Pro Leu val Asp Asp Tyr Val val Asp 
115 120 125 

lie Pro Gin Asp Thr Trp lie Glu Gin Pro Trp Asp Met Gly val val 
130 135 140 

Phe val Pro Pro lie Ser Gly Arg 
145 150 

<210> 21 
<211> 459 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)..(456) 

<223> coding for xanthine-guanine phosphoribosyl 
transferase (gpt) 

<400> 21 

atg age gaa aaa tac ate gtc acc tgg gac atg ttg cag ate cat gca 48 

Met ser Glu Lys Tyr lie val Thr Trp Asp Met Leu Gin lie His Ala 

1 5 10 15 

cgt aaa etc gca age cga ctg atg cct tct gaa caa tgg aaa ggc att 96 
Arg Lys Leu Ala Ser Arg Leu Met Pro Ser Glu Gin Trp Lys Gly lie 
20 25 30 

att gee gta age cgt ggc gqt ctg gta ccg gqt gcg tta ctg gcg cgt 144 
lie Ala val Ser Arg Gly Gly Leu val Pro Gly Ala Leu Leu Ala Arg 
35 ~ 40 45 

gaa ctg gqt att cgt cat gtc gat acc gtt tgt att tec age tac gat 192 
Glu Leu Gly lie Arg His Val Asp Thr Val Cys lie Ser Ser Tyr Asp 
50 ^ 55 60 

cac gac aac cag cgc gag ctt aaa gtq ctg aaa cgc gca gaa ggc gat 240 
His Asp Asn Gin Arg Glu Leu Lys val Leu Lys Arg Ala Glu Gly Asp 
65 70 75 ~ 80 
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ggc gaa gqc ttc ate gtt att gat gac ctg gtq gat acc ggt ggt act 288 
Gly Glu Gly Phe lie val lie Asp Asp Leu Val Asp Thr Gly Gly Thr 
85 90 95 

gcg gtt gcg att cgt gaa atg tat cca aaa gcg cac ttt gtc acc ate 336 
Ala val Ala lie Arg Glu Met Tyr Pro Lys Ala His Phe Val Thr lie 
100 105 110 

ttc gca aaa ccg get ggt cgt ccg ctg gtt gat gac tat gtt gtt gat 384 
Phe Ala Lys Pro Ala Gly Arg Pro Leu Val Asp Asp Tyr Val Val Asp 
115 120 125 

ate ccg caa gat acc tgg att gaa cag ccg tgg gat atg ggc gtc gta 432 
lie Pro Gin Asp Thr Trp lie Glu Gin Pro Trp Asp Met Gly Val Val 
130 135 140 

ttc gtc ccg cca ate tec ggt cgc taa 459 
Phe val Pro Pro lie Ser Gly Arg 
145 150 

<210> 22 
<211> 152 
<212> PRT 

<213> Escherichia coli 
<400> 22 

Met Ser Glu Lys Tyr lie val Thr Trp Asp Met Leu Gin lie His Ala 
15 10 15 

Arg Lys Leu Ala Ser Arg Leu Met Pro Ser Glu Gin Trp Lys Gly lie 
20 25 30 

lie Ala val Ser Arg Gly Gly Leu val Pro Gly Ala Leu Leu Ala Arg 
35 " 40 ' 45 

Glu Leu Gly lie Arg His val Asp Thr val Cys lie Ser Ser Tyr Asp 
50 55 60 

His Asp Asn Gin Arg Glu Leu Lys val Leu Lys Arg Ala Glu Gly Asp 
65 70 75 80 

Gly Glu Gly Phe lie val lie Asp Asp Leu Val Asp Thr Gly Gly Thr 
85 90 95 

Ala val Ala lie Arg Glu Met Tyr Pro Lys Ala His Phe Val Thr lie 
100 ~ 105 110 

Phe Ala Lys Pro Ala Gly Arg Pro Leu val Asp Asp Tyr val val Asp 
115 120 125 

lie Pro Gin Asp Thr Trp lie Glu Gin Pro Trp Asp Met Gly Val Val 
130 135 140 

Phe val Pro Pro lie Ser Gly Arg 
145 150 

<210> 23 
<211> 720 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> CDS 

<222> (1)..(717) 

<223> coding for purine nucleoside phosphorylase (deoD) 
<400> 23 

atg get acc cca cac att aat gca gaa atg ggc gat ttc get gac gta 48 

Met Ala Thr Pro His lie Asn Ala Glu Met Gly Asp Phe Ala Asp Val 

15 10 15 

gtt ttg atg cca gqc gac ccg ctg cgt gcg aag tat att get gaa act 96 

val Leu Met Pro Gly Asp Pro Leu Arg Ala Lys Tyr lie Ala Glu Thr 

20 25 30 

ttc ctt gaa gat gec cgt gaa gtg aac aac gtt cgc ggt atg ctg gqc 144 

Phe Leu Glu Asp Ala Arg Glu val Asn Asn val Arg Gly Met Leu Gly 

35 40 45 

ttc acc ggt act tac aaa gqc cgc aaa att tec gta atg ggt cac ggt 192 

Phe Thr Gly Thr Tyr Lys Gly Arg Lys lie Ser val Met Gly His Gly 
50 55 60 

atg ggt ate ccg tec tgc tec ate tac acc aaa gaa ctg ate acc gat 240 

Met Gly lie Pro Ser Cys Ser lie Tyr Thr Lys Glu Leu lie Thr Asp 

65 70 75 80 

ttc gqc gtg aag aaa att ate cgc gtg ggt tec tgt ggc gca gtt ctg 288 

Phe Gly val Lys Lys lie lie Arg val Gly Ser Cys Gly Ala val Leu 

85 90 95 

ccg cac gta aaa ctg cgc gac gtc gtt ate ggt atg ggt gee tgc acc 336 

pro His val Lys Leu Arg Asp val val lie Gly Met Gly Ala Cys Thr 

100 105 110 

gat tec aaa gtt aac cgc ate cgt ttt aaa gac cat gac ttt gee get 384 

Asp Ser Lys Val Asn Arg lie Arg Phe Lys Asp His Asp Phe Ala Ala 

115 120 125 

ate get gac ttc gac atg gtg cgt aac gca gta gat gca get aaa gca 432 

lie Ala Asp Phe Asp Met val Arg Asn Ala val Asp Ala Ala Lys Ala 
130 135 ~ 140 

ctg ggt att gat get cgc gtg ggt aac ctg ttc tec get gac ctg ttc 480 

Leu Gly lie Asp Ala Arg val Gly Asn Leu Phe Ser Ala Asp Leu Phe 

145 150 155 160 

tac tct ccg gac ggc gaa atg ttc gac gtg atg gaa aaa tac ggc att 528 

Tyr Ser Pro Asp Gly Glu Met Phe Asp Val Met Glu Lys Tyr Gly lie 

165 170 175 

etc ggc gtg gaa atg gaa gcg get ggt ate tac ggc gtc get gca gaa 576 

Leu Gly val Glu Met Glu Ala Ala Gly lie Tyr Gly val Ala Ala Glu 

180 185 190 

ttt ggc gcg aaa gee ctg acc ate tgc acc gta tct gac cac ate cgc 624 

Phe Gly Ala Lys Ala Leu Thr lie Cys Thr Val Ser Asp His lie Arg 

195 200 205 

act cac gag cag acc act gee get gag cgt cag act acc ttc aac gac 672 
Thr His Glu Gin Yhr Thr Ala Ala Glu Arg Gin Thr Thr Phe Asn Asp 
210 215 220 

atg ate aaa ate gca ctg gaa tec gtt ctg ctg ggc gat aaa gag taa 720 
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Met lie Lys lie Ala Leu Glu Ser Val Leu Leu Gly Asp Lys Glu 
225 230 235 



<210> 24 
<211> 239 
<212> PRT 

<213> Escherichia coli 
<400> 24 

Met Ala Thr Pro His lie Asn Ala Glu Met Gly Asp Phe Ala Asp val 
1 5 10 15 

Val Leu Met Pro Gly Asp Pro Leu Arg Ala Lys Tyr lie Ala Glu Thr 
20 25 30 

Phe Leu Glu Asp Ala Arg Glu Val Asn Asn Val Arg Gly Met Leu Gly 
35 40 45 

Phe Thr Gly Thr Tyr Lys Gly Arg Lys lie Ser Val Met Gly His Gly 
50 55 60 

Met Gly lie Pro Ser Cys Ser lie Tyr Thr Lys Glu Leu lie Thr Asp 
65 70 75 80 

Phe Gly Val Lys Lys lie lie Arg val Gly Ser Cys Gly Ala val Leu 
85 90 95 

Pro His val Lys Leu Arg Asp val val lie Gly Met Gly Ala Cys Thr 
100 " 105 110 

Asp Ser Lys val Asn Arg lie Arg Phe Lys Asp His Asp Phe Ala Ala 
115 120 125 

lie Ala Asp Phe Asp Met val Arg Asn Ala val Asp Ala Ala Lys Ala 
130 135 140 

Leu Gly lie Asp Ala Arg Val Gly Asn Leu Phe Ser Ala Asp Leu Phe 
145 150 155 160 

Tyr Ser Pro Asp Gly Glu Met Phe Asp val Met Glu Lys Tyr Gly lie 
165 170 175 

Leu Gly val Glu Met Glu Ala Ala Gly lie Tyr Gly val Ala Ala Glu 
180 185 190 

Phe Gly Ala Lys Ala Leu Thr lie Cys Thr Val Ser Asp His lie Arg 
195 200 205 

Thr His Glu Gin Thr Thr Ala Ala Glu Arg Gin Thr Thr Phe Asn Asp 
210 215 220 

Met lie Lys lie Ala Leu Glu Ser val Leu Leu Gly Asp Lys Glu 
225 230 235 

<210> 25 
<211> 1545 
<212> DNA 

<213> Burkholderia caryophylli 

<220> 

<221> CDS 

<222> (1)..(1542) 
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sequence listing 
<223> coding for phosphonate monoester hydrolase (pehA) 

<400> 25 

atg acc aga aaa aat gtc ctg ctt ate gtc gtt gat caa tgg cga gca 48 

Met Thr Arg Lys Asn val Leu Leu lie val Val Asp Gin Trp Arg Ala 
15 10 15 

gat ttt ate cct cac ctg atg egg gcg gag gqg cgc gaa cct ttc ctt 96 

Asp Phe lie Pro His Leu Met Arg Ala Glu Gly Arg Glu Pro Phe Leu 
20 25 30 

aaa act ccc aat ctt gat cgt ctt tgc egg gaa ggc ttg acc ttc cgc 144 

Lys Thr Pro Asn Leu Asp Arg Leu Cys Arg Glu Gly Leu Thr Phe Arg 
35 40 ~ 45 

aat cat gtc acg acg tgc gtg ccg tgt ggt ccg gca agg gca age ctg 192 

Asn His Val Thr Thr Cys val Pro Cys Gly Pro Ala Arg Ala Ser Leu 

50 55 60 



ctg acg gqc etc tac ctg atg aac cac egg gcg gtg cag aac act gtt 
Leu Thr Gly Leu Tyr Leu Met Asn His Arg Ala val Gin Asn Thr Val 
65 70 75 80 



gaa gat ate tgg ctg ccg gaa gqt gaa cat tec gtt ccc gqt get acc 
Glu Asp lie Trp Leu Pro Glu Gly Glu His ser Val Pro Gly Ala Thr 
165 170 175 



240 



ccg ctt gac cag cgc cat eta aac ctt gqc aag gee ctg cgc gee att 288 

pro Leu Asp Gin Arg His Leu Asn Leu Gly Lys Ala Leu Arg Ala lie 

85 90 95 

gqc tac gat ccc gcg etc att gqt tac acc acc acg aca cct gat ccg 336 

Gly Tyr Asp Pro Ala Leu lie Gly Tyr Thr Thr Thr Thr Pro Asp Pro 

100 105 110 

cgc aca acc tct gca agg gat ccg cgt ttc acg gtc ctg gqc gac ate 384 

Arg Thr Thr Ser Ala Arg Asp Pro Arg Phe Thr val Leu Gly Asp lie 

115 120 125 

atg gac gqc ttt cgt teg gtc gqc gca ttc gag ccc aat atg gag gqg 432 

Met Asp Gly Phe Arg Ser Val Gly Ala Phe Glu Pro Asn Met Glu Gly 

130 135 140 

tat ttt gqc tgg gtg gcg cag aac ggc ttc gaa ctg cca gag aac cgc 480 

Tyr Phe Gly Trp val Ala Gin Asn Gly Phe Glu Leu Pro Glu Asn Arg 

145 150 155 160 



528 



gac aaa ccg teg cgc att ccg aag gaa ttt teg gat teg aca ttc ttc 576 

Asp Lys Pro Ser Arg lie Pro Lys Glu Phe Ser Asp Ser Thr Phe Phe 
180 185 190 

acg gag cgc gec ctg aca tat ctg aag ggc agg gac ggc aag cct ttc 624 

Thr Glu Arg Ala Leu Thr Tyr Leu Lys Gly Arg Asp Gly Lys Pro Phe 
195 200 205 

ttc ctg cat ctt ggc tat tat cgc ccg cat ccg cct ttc gta gec tec 672 

Phe Leu His Leu Gly Tyr Tyr Arg Pro His Pro Pro Phe val Ala Ser 

210 215 220 

gcg ccc tac cat gcg atg tac aaa gec gaa gat atg cct gcg cct ata 720 

Ala Pro Tyr His Ala Met Tyr Lys Ala Glu Asp Met Pro Ala Pro lie 

225 230 235 240 

Page 33 



sequence listing 

cgt gcg gag aat ccg gat gcc gaa gcg gca cag cat ccg etc atg aag 768 

Arg Ala Glu Asn Pro Asp Ala Glu Ala Ala Gin His Pro Leu Met Lys 
245 250 255 

cac tat ate gac cac ate aga cgc gqc teg ttc ttc cat gqc gcg gaa 816 

His Tyr lie Asp His lie Arg Arg Gly Ser Phe Phe His Gly Ala Glu 
260 265 270 

gqc teg gga gca acg ctt gat gaa gqc gaa att cgc cag atg cgc get 

Gly Ser Gly Ala Thr Leu Asp Glu Gly Glu He Arg Gin Met Arg Ala 

275 280 285 

aca tat tgc gga ctg ate ace gag ate gac gat tgt ctg ggg agg gtc 912 

Thr Tyr Cys Gly Leu lie Thr Glu lie Asp Asp Cys Leu Gly Arg val 

290 295 300 

ttt gcc tat etc gat gaa ace ggt cag tgg gac gac acg ctg att ate 960 

Phe Ala Tyr Leu Asp Glu Thr Gly Gin Trp Asp Asp Thr Leu lie lie 

305 310 315 320 



ttc acg age gat cat gqc gaa caa ctg gqc gat cat cac ctg etc gqc 
Phe Thr Ser Asp His Gly Glu Gin Leu Gly Asp His His Leu Leu Gly 
325 330 335 



gat gcg gga cag aac egg cac gcc gqc cag ate gaa gaa gqc ttc tec 

Asp Ala Gly Gin Asn Arg His Ala Gly Gin lie Glu Glu Gly Phe Ser 
355 360 365 

gaa age ate gac gtc atg ccg ace ate etc gaa tgg ctg gqc ggg gaa 

Glu Ser lie Asp val Met Pro Thr lie Leu Glu Trp Leu Gly Gly Glu 

370 375 380 



gat gat tgc age etc tgt gtg ate gag gac gaa aac tac aag tac gtg 
Asp Asp Cys Ser Leu Cys val lie Glu Asp Glu Asn Tyr Lys Tyr val 
435 440 445 



864 



1008 



aag ate ggt tac aat gcc gaa age ttc cgt att ccc ttg gtc ata aag 1056 
Lys lie Gly Tyr Asn Ala Glu Ser Phe Arg lie Pro Leu val lie Lys 
340 345 350 



1104 



1152 



acg cct cgc gcc tgc gac gqc cgt teg ctg ttg ccg ttt ctg get gag 1200 

Thr Pro Arg Ala Cys Asp Gly Arg Ser Leu Leu Pro Phe Leu Ala Glu 

385 390 395 400 

gga aag ccc tec gac tgg cgc acg gaa eta cat tac gag ttc gat ttt 1248 

Gly Lys Pro Ser Asp Trp Arg Thr Glu Leu His Tyr Glu Phe Asp Phe 

405 ~ 410 415 

cgc gat gtc ttc tac gat cag ccg cag aac teg gtc cag ctt tec cag 1296 

Arg Asp val Phe Tyr Asp Gin Pro Gin Asn ser Val Gin Leu Ser Gin 

420 425 430 



1344 



cat ttt gcc gcc ctg ccg ccg ctg ttc ttc gat ctg aag gca gac ccg 1392 

His Phe Ala Ala Leu Pro Pro Leu Phe Phe Asp Leu Lys Ala Asp Pro 
450 455 460 

cat gaa ttc age aat ctg get gqc gat cct get tat gcg gcc etc gtt 1440 

His Glu Phe Ser Asn Leu Ala Gly Asp Pro Ala Tyr Ala Ala Leu Val 

465 470 475 480 

cgt gac tat gcc cag aag gca ttg teg tgg cga ctg tct cat gcc gac 1488 

Arg Asp Tyr Ala Gin Lys Ala Leu ser Trp Arg Leu Ser His Ala Asp 
485 490 495 
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egg aca etc acc cat tac aga tec age ccg caa gqg ctg aca acg cgc 1536 
Arg Thr Leu Thr His Tyr Arg Ser Ser Pro Gin Gly Leu Thr Thr Arg 
500 505 510 

aac cat tga 1545 
Asn His 

<210> 26 
<211> 514 
<212> PRT 

<213> Burkholderia caryophylli 
<400> 26 

Met Thr Arg Lys Asn Val Leu Leu lie Val val Asp Gin Trp Arg Ala 
1 5 10 15 

Asp Phe lie Pro His Leu Met Arg Ala Glu Gly Arg Glu Pro Phe Leu 
20 25 ~ 30 

Lys Thr Pro Asn Leu Asp Arg Leu Cys Arg Glu Gly Leu Thr Phe Arg 
35 40 45 

Asn His Val Thr Thr Cys val Pro Cys Gly Pro Ala Arg Ala Ser Leu 
50 55 60 

Leu Thr Gly Leu Tyr Leu Met Asn His Arg Ala val Gin Asn Thr val 
65 70 75 80 

Pro Leu Asp Gin Arg His Leu Asn Leu Gly Lys Ala Leu Arg Ala lie 
85 90 95 

Gly Tyr Asp Pro Ala Leu lie Gly Tyr Thr Thr Thr Thr Pro Asp Pro 
100 105 110 

Arg Thr Thr Ser Ala Arg Asp Pro Arg Phe Thr val Leu Gly Asp lie 
115 "* 120 125 

Met Asp Gly Phe Arg Ser Val Gly Ala Phe Glu Pro Asn Met Glu Gly 
130 ~ 135 140 

Tyr Phe Gly Trp Val Ala Gin Asn Gly Phe Glu Leu Pro Glu Asn Arg 
145 150 155 160 

Glu Asp lie Trp Leu Pro Glu Gly Glu His Ser Val Pro Gly Ala Thr 
165 170 175 

Asp Lys Pro Ser Arg lie Pro Lys Glu Phe Ser Asp Ser Thr Phe Phe 
180 185 190 

Thr Glu Arg Ala Leu Thr Tyr Leu Lys Gly Arg Asp Gly Lys Pro Phe 
195 200 205 

Phe Leu His Leu Gly Tyr Tyr Arg Pro His Pro Pro Phe val Ala Ser 
210 215 220 

Ala Pro Tyr His Ala Met Tyr Lys Ala Glu Asp Met Pro Ala Pro lie 
225 230 235 240 

Arg Ala Glu Asn Pro Asp Ala Glu Ala Ala Gin His Pro Leu Met Lys 
245 250 255 

His Tyr lie Asp His lie Arg Arg Gly Ser Phe Phe His Gly Ala Glu 
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260 265 270 

Gly Ser Gly Ala Thr Leu Asp Glu Gly Glu lie Arg Gin Met Arg Ala 
275 280 285 

Thr Tyr Cys Gly Leu lie Thr Glu lie Asp Asp Cys Leu Gly Arg val 
290 295 300 

Phe Ala Tyr Leu Asp Glu Thr Gly Gin Trp Asp Asp Thr Leu lie lie 
305 310 315 320 

Phe Thr Ser Asp His Gly Glu Gin Leu Gly Asp His His Leu Leu Gly 
325 330 335 

Lys lie Gly Tyr Asn Ala Glu Ser Phe Arg lie Pro Leu Val lie Lys 
340 345 350 

Asp Ala Gly Gin Asn Arg His Ala Gly Gin lie Glu Glu Gly Phe Ser 
355 360 365 

Glu Ser lie Asp Val Met Pro Thr lie Leu Glu Trp Leu Gly Gly Glu 
370 375 380 

Thr Pro Arg Ala Cys Asp Gly Arg ser Leu Leu Pro Phe Leu Ala Glu 
385 390 ~ 395 400 

Gly Lys Pro Ser Asp Trp Arg Thr Glu Leu His Tyr Glu Phe Asp Phe 
405 ~ 410 415 

Arg Asp val Phe Tyr Asp Gin Pro Gin Asn ser val Gin Leu ser Gin 
420 425 430 

Asp Asp Cys ser Leu Cys Val lie Glu Asp Glu Asn Tyr Lys Tyr val 
435 440 445 

His Phe Ala Ala Leu Pro Pro Leu Phe Phe Asp Leu Lys Ala Asp Pro 
450 455 460 

His Glu Phe Ser Asn Leu Ala Gly Asp Pro Ala Tyr Ala Ala Leu Val 
465 470 475 480 

Arg Asp Tyr Ala Gin Lys Ala Leu ser Trp Arg Leu Ser His Ala Asp 
485 490 495 

Arg Thr Leu Thr His Tyr Arg Ser Ser Pro Gin Gly Leu Thr Thr Arg 
500 505 510 

Asn His 



<210> 27 
<211> 2250 
<212> DNA 

<213> Agrobacterium rhizogenes 

<220> 

<221> CDS 

<222> (1)..(2247) 

<223> coding for tryptophan oxygenase (auxl) 
<400> 27 

atg get gqa tec tec ttc aca ttg cca tea act gqc tea gcg ccc ctt 
Met Ala Gly Ser Ser Phe Thr Leu Pro Ser Thr Gly Ser Ala Pro Leu 
15 10 15 
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9 ggt 
j Gly 

20 2'5 30 



35 " 40 45 

ctg acc agg agg get tct cca gac g 

Leu Thr Arg Arg Ala Ser Pro Asp v 
50 ~ 55 

cat gec ctt gcg ttc ata tat gtc a 

His Ala Leu Ala Phe lie Tyr Val A 

65 70 75 80 

aaa aac get cga teg eta acc gaa gcg aac ggc gtc aaa gat ctt 

Lys Asn Ala Arg Ser Leu Thr Glu Ala Asn Gly Val Lys Asp Leu 
85 90 95 

gee ate gac gtt ccg cca ttt cga aac gac ttc tea aga gcg 

Ala lie Asp val Pro Pro Phe Arg Asn Asp Phe Ser Arg Ala 
100 105 110 

ctt caa gtq ate aac ttg ttg gga a 

Leu Gin val lie Asn Leu Leu Gly A 

115 120 12 5 

ctt agt cac ttc ata gca gtt get c 

Leu Ser His Phe lie Ala val Ala L 
130 135 



145 150 155 160 

jqg ttc ccg ate act cgt gaa gqa aax gtq gca xgn gaa gtg eta gee 528 

j\y Phe Pro lie Thr Arg Glu Gly Asn val Ala Cys Glu Val Leu Ala 

165 170 175 



180 185 * 190 



195 200 205 



Asp Gly Arg lie Gly Tyr Phe Pro 
210 215 



Ala lie He Gly Ala Gly Phe Ser c 

225 230 235 240 



245 250 255 

gga aag 

Leu Gly 



ctt gga gga aag eta tgg tea cac gga ttt aag agt get cca aat gtg 816 
Iy Gly Lys Leu Trp Ser His C 



ctg 
Leu 


ctg 
Leu 


caa 
Gin 


aca 
Thr 


ccg 
Pro 


cag 
Gin 


tea 
Ser 


tct 
Ser 


ggc 

Gl V 
60 


Ala 


gag 

Glu 
75 


acg 
Thr 


aac 
Asn 
90 


ggc 
Gly 


Val 


gac 
Asp 


ttc 
Phe 


tea 
Ser 


aac 
Asn 


cga 
Arg 


aat 
Asn 


cca 
Pro 


aac 
Asn 


age 
Ser 
140 


age 
Ser 


ttg 
Leu 
155 


tea 
Ser 


val 
170 


gca 
Ala 


tgt 
Cys 


tgc 

Cvs 


tec 
Ser 


gat 

ASp 


ttc 
Phe 


ttc 
Phe 


gag 

Glu 


ggc 
Gly 


gtt 
val 


acg 
Thr 
220 


etc 
Leu 


gtt 
val 
235 


gca 

A 1 d. 


gtq 
val 
250 


tat 
Tyr 


gag 
Glu 


ttt aag 
Phe Lys 
Page 


agt 
Ser 
37 



96 



144 



192 



240 



288 



336 



384 



432 



480 



576 



624 



672 



720 



768 



sequence listing 
260 265 270 

ata gcc gag atg ggg gcc atg cgt ttt ccg cga agt gaa tea tgc ttg 864 
lie Ala Glu Met Gly Ala Met Arg Phe Pro Arg Ser Glu Ser Cys Leu 
275 280 ^ 285 

ttc ttc tat etc aaa aag cac gqa ctg gac tec gtt gqt ctg ttc ccg 912 
Phe Phe Tyr Leu Lys Lys His Gly Leu Asp ser val Gly Leu Phe Pro 
290 295 300 

aat ccg gqa agt gtc gat acc gca ttg ttc tac agg gqc cgt caa tat 960 
Asn Pro Gly Ser val Asp Thr Ala Leu Phe Tyr Arg Gly Arg Gin Tyr 
305 310 315 320 



ate tgg aaa gcg gqa gag gag cca ccg gag ctg ttt cgt cgt gtg cac 
lie Trp Lys Ala Gly Glu Glu Pro Pro Glu Leu Phe Arg Arg val His 
325 330 335 



cga gag tct ttc tct tct gqc ate gaa aaa atg ttc ttg gqc aat cat 

Arg Glu ser Phe Ser ser Gly lie Glu Lys Met Phe Leu Gly Asn His 

385 390 395 400 

cct ccg ggg gqt gaa caa tgg aat tec eta gat gac ttg gat ctt ttc 

Pro Pro Gly Gly Glu Gin Trp Asn Ser Leu Asp Asp Leu Asp Leu Phe 
405 410 415 

aaa gcg ctg gqt att gqa tec gqc gqa ttc gqc cct gta ttt gaa agt 

Lys Ala Leu Gly lie Gly Ser Gly Gly Phe Gly Pro val Phe Glu Ser 
420 425 430 



gtg egg ctg agt tac gaa gqa att tct gag ctg cct cat agg ate gcc 
val Arg Leu Ser Tyr Glu Gly lie Ser Glu Leu Pro His Arg lie Ala 
450 455 460 



1008 



cat gqa tgg cgc gca ttt ttg caa gat gqc tat etc cat gat gqa gtc 1056 

His Gly Trp Arg Ala Phe Leu Gin Asp Gly Tyr Leu His Asp Gly val 

340 345 350 

atg ttg gcg tea ccg tta gca att gtt gac gcc ttg aat tta ggg cat 1104 

Met Leu Ala Ser Pro Leu Ala lie val Asp Ala Leu Asn Leu Gly His 

355 360 365 

eta cag cag gcg cat gqc ttc tgg caa tct tgg etc aca tat ttt gag 1152 

Leu Gin Gin Ala His Gly Phe Trp Gin Ser Trp Leu Thr Tyr Phe Glu 

370 375 380 



1200 



1248 



1296 



ggg ttt ate gag ate ctt cgc tta gtc gtc aac ggg tat gag gat aac 1344 
Gly Phe lie Glu He Leu Arg Leu Val val Asn Gly Tyr Glu Asp Asn 
435 440 445 



1392 



tea cag gta att aac gqc aga tct att cgc gag cgt aca att cac gtt 1440 

ser Gin val lie Asn Gly Arg Ser lie Arg Glu Arg Thr lie His Val 
465 470 ~ 475 480 

caa gtc gag cag att gat aga gag gag gat aaa ata aat ate aag ate 1488 

Gin val Glu Gin lie Asp Arg Glu Glu Asp Lys lie Asn lie Lys lie 

485 490 495 

aaa gqa gqa aag gtt gag gtc tat gat cga gta ctg gtt aca tec ggg 1536 

Lys Gly Gly Lys val Glu val Tyr Asp Arg Val Leu Val Thr Ser Gly 
500 505 ~ 510 

ttt gcg aac ate gaa atg cgc cat etc ctg aca tea age aac gca ttc 1584 
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Phe Ala Asn lie Glu Met Arg His Leu Leu Thr Ser Ser Asn Ala Phe 
515 520 525 

ttc cat gca gat gta age cat gca ata ggg aac agt cat atg act ggt 
Phe His Ala Asp val Ser His Ala lie Gly Asn Ser His Met Thr Gly 
530 535 540 



gat aat ate gtt caa cat gat tgg ctg act gat ccc cac get ggc gga 
Asp Asn lie val Gin His Asp Trp Leu Thr Asp Pro His Ala Gly Gly 
645 650 655 



ttc ttt cag ccc ttt gac gta atg cat ccc gcg gac gat aag gqa ctt 
Phe Phe Gin Pro Phe Asp val Met His Pro Ala Asp Asp Lys Gly Leu 
675 680 685 



1632 



gcg tea aaa ctg ttc ttg ctg act aac gaa aaa ttc tgg eta caa cat 1680 

Ala Ser Lys Leu Phe Leu Leu Thr Asn Glu Lys Phe Trp Leu Gin His 

545 550 555 560 

cat ttg cca teg tgc ata etc ace ace ggc gtt gca aag gca gtt tat 1728 

His Leu Pro Ser cys lie Leu Thr Thr Gly val Ala Lys Ala val Tyr 

565 570 575 

tgc tta gac tat gat ccg cga gat cca age ggc aaa gqa ctg gtq ttg 1776 

Cys Leu Asp Tyr Asp Pro Arg Asp Pro Ser Gly Lys Gly Leu val Leu 
580 585 590 

ata age tat act tgg gag gat gac tea cat aag etc eta gee gtc ccc 1824 

lie Ser Tyr Thr Trp Glu Asp Asp Ser His Lys Leu Leu Ala Val Pro 

595 600 605 

gac aaa aga gaa agg ttc gca teg ctg cag cgc gat att ggg agg gca 1872 

Asp Lys Arg Glu Arg Phe Ala Ser Leu Gin Arg Asp lie Gly Arg Ala 

610 615 620 

ttc cca gat ttt gec aag cac eta act cct gca gac ggg aac tat gat 1920 

Phe Pro Asp Phe Ala Lys His Leu Thr Pro Ala Asp Gly Asn Tyr Asp 

625 630 635 640 



1968 



gcg ttt aaa ctg aac cgc aga ggc aac gac gta tat tea gaa agg ctt 2016 
Ala Phe Lys Leu Asn Arg Arg Gly Asn Asp val Tyr Ser Glu Arg Leu 
660 ^ 665 670 



2064 



tac ttg gee gqt tgt age tgt tec ttc acc gqa ggg tgg gtt cat ggt 2112 
Tyr Leu Ala Gly Cys Ser Cys Ser Phe Thr Gly Gly Trp Val His Gly 
690 695 700 

gec att cag acc gca tgc aac get acg tgt gcg ate att tat ggt tec 2160 
Ala lie Gin Thr Ala Cys Asn Ala Thr cys Ala lie lie Tyr Gly Ser 
705 710 715 720 

gqa cac ctg caa gag eta ate cac tgg cga cac etc aaa gaa ggt aat 2208 
Gly His Leu Gin Glu Leu lie His Trp Arg His Leu Lys Glu Gly Asn 
725 730 735 

cca ctg gcg cac get tgg aag egg tat agg tat caa gcg tga 2250 
Pro Leu Ala His Ala Trp Lys Arg Tyr Arg Tyr Gin Ala 
740 745 

<210> 28 
<211> 749 
<212> PRT 
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<213> Agrobacterium rhizogenes 
<400> 28 

Met Ala Gly Ser Ser Phe Thr Leu Pro Ser Thr Gly Ser Ala Pro Leu 
15 10 15 

Asp Met Met Leu lie Asp Asp Ser Asp Leu Leu Gin Leu Gly Leu Gin 
20 25 30 

Gin Val Phe Ser Lys Arg Tyr Thr Glu Thr Pro Gin Ser Arg Tyr Lys 
35 " 40 45 

Leu Thr Arg Arg Ala Ser Pro Asp val Ser Ser Gly Glu Gly Asn Val 
50 " 55 60 

His Ala Leu Ala Phe lie Tyr val Asn Ala Glu Thr Leu Gin Met lie 
65 70 75 80 

Lys Asn Ala Arg Ser Leu Thr Glu Ala Asn Gly val Lys Asp Leu val 
85 90 95 

Ala lie Asp val Pro Pro Phe Arg Asn Asp Phe Ser Arg Ala Leu Leu 
100 105 110 

Leu Gin val lie Asn Leu Leu Gly Asn Asn Arg Asn Ala Asp Asp Asp 
115 120 125 

Leu Ser His Phe lie Ala Val Ala Leu Pro Asn ser Ala Arg Ser Lys 
130 135 140 

lie Leu Thr Thr Ala Pro Phe Glu Gly Ser Leu Ser Glu Asn Phe Ar 
145 150 155 16 

Gly Phe Pro lie Thr Arg Glu Gly Asn Val Ala Cys Glu Val Leu Ala 
165 170 175 

Tyr Gly Asn Asn Leu Met Pro Lys Ala Cys Ser Asp Ser Phe Pro Thr 
180 185 190 

Val Asp Leu Leu Tyr Asp Tyr Gly Lys Phe Phe Glu Ser Cys Ala Ala 
195 200 205 

Asp Gly Arg lie Gly Tyr Phe Pro Glu Gly Val Thr Lys Pro Lys Val 
210 215 220 

Ala lie lie Gly Ala Gly Phe Ser Gly Leu Val Ala Ala ser Glu Leu 
225 230 235 240 

Leu His Ala Gly Val Asp Asp val Thr val Tyr Glu Ala Ser Asp Arg 
245 250 255 

Leu Gly Gly Lys Leu Trp Ser His Gly Phe Lys Ser Ala Pro Asn val 
260 265 270 

lie Ala Glu Met Gly Ala Met Arg Phe Pro Arg Ser Glu Ser cys Leu 
275 280 285 

Phe Phe Tyr Leu Lys Lys His Gly Leu Asp Ser val Gly Leu Phe Pro 
290 ' 295 300 

Asn Pro Gly Ser val Asp Thr Ala Leu Phe Tyr Arg Gly Arg Gin Tyr 
305 ' 310 315 320 
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lie Trp Lys Ala Gly Glu Glu Pro Pro Glu Leu Phe Arg Arg val His 
325 330 335 

His Gly Trp Arg Ala Phe Leu Gin Asp Gly Tyr Leu His Asp Gly val 
340 345 350 

Met Leu Ala Ser Pro Leu Ala lie val Asp Ala Leu Asn Leu Gly His 
355 360 365 

Leu Gin Gin Ala His Gly Phe Trp Gin ser Trp Leu Thr Tyr Phe Glu 
370 375 380 

Arg Glu Ser Phe Ser Ser Gly lie Glu Lys Met Phe Leu Gly Asn His 
385 390 395 400 

Pro Pro Gly Gly Glu Gin Trp Asn Ser Leu Asp Asp Leu Asp Leu Phe 
405 410 415 

Lys Ala Leu Gly lie Gly Ser Gly Gly Phe Gly Pro val Phe Glu ser 
420 425 430 

Gly Phe lie Glu lie Leu Arg Leu val Val Asn Gly Tyr Glu Asp Asn 
435 440 445 

val Arg Leu Ser Tyr Glu Gly lie Ser Glu Leu Pro His Arg lie Ala 
450 455 460 

ser Gin val lie Asn Gly Arg ser lie Arg Glu Arg Thr lie His Val 
465 470 ~ 475 480 

Gin Val Glu Gin He Asp Arg Glu Glu Asp Lys lie Asn lie Lys lie 
485 490 495 

Lys Gly Gly Lys val Glu val Tyr Asp Arg val Leu val Thr Ser Gly 
500 505 ~ 510 

Phe Ala Asn lie Glu Met Arg His Leu Leu Thr Ser Ser Asn Ala Phe 
515 520 525 

Phe His Ala Asp val Ser His Ala He Gly Asn Ser His Met Thr Gly 
530 535 540 

Ala Ser Lys Leu Phe Leu Leu Thr Asn Glu Lys Phe Trp Leu Gin His 
545 550 555 560 

His Leu Pro Ser Cys lie Leu Thr Thr Gly val Ala Lys Ala val Tyr 
565 570 575 

Cys Leu Asp Tyr Asp Pro Arg Asp Pro Ser Gly Lys Gly Leu val Leu 
580 ~ 585 590 

lie Ser Tyr Thr Trp Glu Asp Asp Ser His Lys Leu Leu Ala Val Pro 
595 600 605 

Asp Lys Arg Glu Arg Phe Ala Ser Leu Gin Arg Asp lie Gly Arg Ala 
610 ~ 615 620 

Phe Pro Asp Phe Ala Lys His Leu Thr Pro Ala Asp Gly Asn Tyr Asp 
625 630 635 640 

Asp Asn lie val Gin His Asp Trp Leu Thr Asp Pro His Ala Gly Gly 
645 650 655 
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Ala Phe Lys Leu Asn Arg Arg Gly Asn Asp Val Tyr Ser Glu Arg Leu 
660 665 670 

Phe Phe Gin Pro Phe Asp Val Met His Pro Ala Asp Asp Lys Gly Leu 
675 680 685 

Tyr Leu Ala Gly Cys Ser Cys Ser Phe Thr Gly Gly Trp Val His Gly 
690 695 700 

Ala lie Gin Thr Ala Cys Asn Ala Thr Cys Ala lie lie Tyr Gly Ser 
705 710 715 720 

Gly His Leu Gin Glu Leu lie His Trp Arg His Leu Lys Glu Gly Asn 
725 730 735 

Pro Leu Ala His Ala Trp Lys Arg Tyr Arg Tyr Gin Ala 
740 745 

<210> 29 
<211> 1401 
<212> DNA 

<213> Agrobacterium rhizogenes 

<220> 

<221> CDS 

<222> (1) . . (1398) 

<223> coding for indoleacetamide hydrolase 
<400> 29 

atg gtg acc etc tec teg ate acc gag acg ctt aaa tgt etc agg gaa 
Met val Thr Leu Ser Ser lie Thr Glu Thr Leu Lys Cys Leu Arg Glu 
15 10 15 



eta egg tgg act gee age aaa ate gat caa cac gqa ggt gee ggt gtt 
Leu Arg Trp Thr Ala Ser Lys lie Asp Gin His Gly Gly Ala Gly val 
50 55 60 



48 



aga aaa tac teg tgc ttt gag tta ate gaa acg ata ata gee cgc tgt 96 

Arg Lys Tyr Ser Cys Phe Glu Leu lie Glu Thr lie lie Ala Arg Cys 
20 25 30 

gaa gca gca aga tec tta aac gee ttt ctg gaa acc gac tgg gcg cac 144 

Glu Ala Ala Arg Ser Leu Asn Ala Phe Leu Glu Thr Asp Trp Ala His 
35 ~ 40 45 



192 



ggc eta get ggc gtt ccc eta tgc ttt aaa gcg aat att gcg aca gqc 240 

Gly Leu Ala Gly Val Pro Leu Cys Phe Lys Ala Asn lie Ala Thr Gly 

65 70 * 75 80 

agg ttc gec gcg acc get ggt acg cca ggc tta cag aac cac aaa ccc 288 

Arg Phe Ala Ala Thr Ala Gly Thr Pro Gly Leu Gin Asn His Lys Pro 

85 90 95 

aag acg cct gec gqa gtt gca cga caa ctt etc gcg get ggg gca ctg 336 

Lys Thr Pro Ala Gly val Ala Arg Gin Leu Leu Ala Ala Gly Ala Leu 

100 105 110 

cct ggc get teg gqa aac atg cac gaa ttg tct ttt ggg ate acg age 384 

Pro Gly Ala Ser Gly Asn Met His Glu Leu Ser Phe Gly lie Thr Ser 

115 * 120 125 

aac aac ttc gee aca gqc gec gta cga aac ccg tgg aac cct agt etc 432 

Asn Asn Phe Ala Thr Gly Ala val Arg Asn Pro Trp Asn Pro Ser Leu 

130 135 " 140 
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ate cca gqg gga tea agt gqg ggt gtg gec gec gcg gtg gec gqc cga 480 

lie Pro Gly Gly Ser Ser Gly Gly val Ala Ala Ala val Ala Gly Arg 
145 150 155 160 



ttg atg ctg gqc gqc gtc gga act gac acg gga gcg teg gtc cgt tta 

Leu Met Leu Gly Gly val Gly Thr Asp Thr Gly Ala Ser Val Arg Leu 
165 170 175 

ccg gec gec ttg tgc gqc gtq gtg gqg ttt cgt cct ace gtq gqg cga 

pro Ala Ala Leu Cys Gly Val val Gly Phe Arg Pro Thr Val Gly Arg 

180 185 190 



gqc gtt ate gca cag aat gtt ccg gac gtq att ctt ctt gac ggt ate 

Gly Val lie Ala Gin Asn Val Pro Asp val lie Leu Leu Asp Gly lie 

210 215 220 

att tgc gqg aga ccg ccg gtt aat caa acg gtc cgc ctg aag gqg ctg 

lie cys Gly Arg Pro Pro Val Asn Gin Thr Val Arg Leu Lys Gly Leu 

225 230 235 240 

cgt ata gqc ttg cca ace get tac ttt tac aac gac ctg gag ccc gat 

Arg lie Gly Leu Pro Thr Ala Tyr Phe Tyr Asn Asp Leu Glu Pro Asp 

245 250 255 



528 



576 



tat cca acg gac gga ata gtt ccg gta age ccc acc egg gac acc cct 624 
Tyr Pro Thr Asp Gly lie val Pro val Ser Pro Thr Arg Asp Thr Pro 
195 200 205 



672 



720 



768 



gtc gec tta gca gee gag acg att ate aga gtt ctg gca cgc aaa gat 816 
val Ala Leu Ala Ala Glu Thr lie lie Arg val Leu Ala Arg Lys Asp 
260 265 270 

gtt act ttt gtt gaa gca gat att cct gat tta gcg cat cac aat gaa 864 
val Thr Phe Val Glu Ala Asp lie Pro Asp Leu Ala His His Asn Glu 
275 280 285 

gqg gtc age ttt ccg act gee ate tac gaa ttt ccg ttg tec ctt gaa 912 
Gly val Ser Phe Pro Thr Ala lie Tyr Glu Phe Pro Leu Ser Leu Glu 
290 295 300 

cat tat att cag aac ttc gta gag ggt gtt tec ttt tct gag gtt gtc 960 
His Tyr lie Gin Asn Phe val Glu Gly Val Ser Phe Ser Glu val val 
305 310 315 320 

aga gcg att cgc agt ccg gat gtt gca agt att etc aat gca caa etc 1008 
Arg Ala lie Arg Ser Pro Asp val Ala Ser lie Leu Asn Ala Gin Leu 
325 330 335 

teg gat aat ctt att tec aaa age gag tat tgt ctg gcg cga cgt ttt 1056 
Ser Asp Asn Leu lie Ser Lys Ser Glu Tyr Cys Leu Ala Arg Arg Phe 
340 345 350 

ttc aga ccg aga etc caa gcg gec tac cac agt tac ttc aag gcg cat 1104 
Phe Arg Pro Arg Leu Gin Ala Ala Tyr His Ser Tyr Phe Lys Ala His 
355 360 365 

cag eta gat gca att ctt ttc cca aca get ccg ttg aca gee aag cca 1152 
Gin Leu Asp Ala lie Leu Phe Pro Thr Ala Pro Leu Thr Ala Lys Pro 
370 375 380 

att gqc cat gat eta teg gtg att cac aat gqc tea atg acc gat acc 1200 
lie Gly His Asp Leu Ser val lie His Asn Gly Ser Met Thr Asp Thr 
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sequence listing 
385 390 395 400 

ttt aaa ate ttc gtq egg aat gta gat ccc age agt aat gcg ggc ctg 1248 
Phe Lys lie Phe Val Arg Asn val Asp Pro Ser Ser Asn Ala Gly Leu 
405 410 415 

ccg ggc eta agt ctt ccc gtt tct ctt agt tec aac gqt ctg cct att 1296 
Pro Gly Leu Ser Leu Pro Val Ser Leu Ser Ser Asn Gly Leu Pro lie 
420 425 430 

ggc atg gaa ate gat ggc tct gca age teg gat gaa cgt ctg tta gca 1344 
Gly Met Glu lie Asp Gly Ser Ala Ser Ser Asp Glu Arg Leu Leu Ala 
435 440 445 

att gqa eta gcg ata gaa gaa gca ata gac ttt agg cat cgt ccg act 1392 
lie Gly Leu Ala lie Glu Glu Ala lie Asp Phe Arg His Arg Pro Thr 
450 455 460 

ctg teg taa 1401 

Leu ser 

465 

<210> 30 
<211> 466 
<212> PRT 

<213> Agrobacterium rhizogenes 
<400> 30 

Met Val Thr Leu Ser Ser lie Thr Glu Thr Leu Lys Cys Leu Arg Glu 
15 10 15 

Arg Lys Tyr Ser Cys Phe Glu Leu lie Glu Thr lie lie Ala Arg Cys 
20 25 30 

Glu Ala Ala Arg Ser Leu Asn Ala Phe Leu Glu Thr Asp Trp Ala His 
35 40 45 

Leu Arg Trp Thr Ala Ser Lys lie Asp Gin His Gly Gly Ala Gly Val 
50 55 60 

Gly Leu Ala Gly Val Pro Leu Cys Phe Lys Ala Asn lie Ala Thr Gly 
65 70 75 80 

Arg Phe Ala Ala Thr Ala Gly Thr Pro Gly Leu Gin Asn His Lys Pro 
85 90 95 

Lys Thr Pro Ala Gly Val Ala Arg Gin Leu Leu Ala Ala Gly Ala Leu 
100 105 110 

Pro Gly Ala Ser Gly Asn Met His Glu Leu Ser Phe Gly lie Thr Ser 
115 120 125 

Asn Asn Phe Ala Thr Gly Ala Val Arg Asn Pro Trp Asn Pro Ser Leu 
130 135 140 

lie Pro Gly Gly Ser Ser Gly Gly Val Ala Ala Ala Val Ala Gly Arg 
145 150 ' 155 160 

Leu Met Leu Gly Gly Val Gly Thr Asp Thr Gly Ala Ser Val Arg Leu 
165 170 175 

pro Ala Ala Leu Cys Gly val Val Gly Phe Arg Pro Thr val Gly Arg 
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180 185 190 

Tyr Pro Thr Asp Gly lie Val Pro val Ser Pro Thr Arg Asp Thr Pro 
195 200 205 

Gly val lie Ala Gin Asn val Pro Asp val lie Leu Leu Asp Gly lie 
210 215 220 

lie Cys Gly Arg Pro Pro val Asn Gin Thr val Arg Leu Lys Gly Leu 
225 230 235 240 

Arg lie Gly Leu Pro Thr Ala Tyr Phe Tyr Asn Asp Leu Glu Pro Asp 
245 250 255 

Val Ala Leu Ala Ala Glu Thr lie He Arg val Leu Ala Arg Lys Asp 
260 265 ~ 270 

val Thr Phe Val Glu Ala Asp lie Pro Asp Leu Ala His His Asn Glu 
275 280 285 

Gly val ser Phe Pro Thr Ala lie Tyr Glu Phe Pro Leu Ser Leu Glu 
290 295 300 

His Tyr lie Gin Asn Phe Val Glu Gly val Ser Phe Ser Glu Val val 
305 310 315 320 

Arg Ala lie Arg Ser Pro Asp Val Ala Ser lie Leu Asn Ala Gin Leu 
325 330 335 

Ser Asp Asn Leu lie Ser Lys Ser Glu Tyr Cys Leu Ala Arg Arg Phe 
340 345 350 

Phe Arg Pro Arg Leu Gin Ala Ala Tyr His Ser Tyr Phe Lys Ala His 
355 360 365 

Gin Leu Asp Ala lie Leu Phe Pro Thr Ala Pro Leu Thr Ala Lys Pro 
370 375 380 

lie Gly His Asp Leu Ser val lie His Asn Gly Ser Met Thr Asp Thr 
385 * 390 395 400 

Phe Lys lie Phe Val Arg Asn val Asp Pro Ser Ser Asn Ala Gly Leu 
405 410 415 

Pro Gly Leu ser Leu Pro Val Ser Leu Ser Ser Asn Gly Leu Pro lie 
420 425 430 

Gly Met Glu lie Asp Gly Ser Ala Ser Ser Asp Glu Arg Leu Leu Ala 
435 440 445 

lie Gly Leu Ala lie Glu Glu Ala lie Asp Phe Arg His Arg Pro Thr 
450 455 460 

Leu Ser 
465 

<210> 31 
<211> 2268 
<212> DNA 

<213> Agrobacterium tumefaciens 

<220> 

<221> CDS 
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<222> (1)..(2265) 

<223> coding for tryptophan monooxygenase 
<400> 31 

atg tea get tea cct etc ctt gat aac cag tgc gat cat ttc tct acc 48 

Met Ser Ala Ser Pro Leu Leu Asp Asn Gin Cys Asp His Phe Ser Thr 
15 10 15 



aaa atg gtq gat ctg ata atg gtc gat aag get gat gaa ttg gac cgc 
Lys Met val Asp Leu lie Met val Asp Lys Ala Asp Glu Leu Asp Arg 
20 25 30 

agg gtt tec gat gee ttc tea gaa cgt gaa get tct agg gqa agg agg 
Arg val Ser Asp Ala Phe Ser Glu Arg Glu Ala Ser Arg Gly Arg Arg 
35 40 



att act caa ate tec ggc gag tgc age get gqg tta get tgc aaa agg 
lie Thr Gin lie ser Gly Glu Cys Ser Ala Gly Leu Ala Cys Lys Arg 
50 55 60 

ctg gee gac gqt cgc ttt ccc gag ate tea act ggt gag aag gta gca 
Leu Ala Asp Gly Arg Phe Pro Glu lie Ser Thr Gly Glu Lys val Ala 
65 70 75 80 

gec etc tec get tac ate tat gtt ggc aag gaa att ctg gqg egg ata 
Ala Leu Ser Ala Tyr lie Tyr Val Gly Lys Glu lie Leu Gly Arg lie 
85 90 95 



ggt cga aac ctg atg ctg aag get teg gca gqt tec ttt cca aca att 
Gly Arg Asn Leu Met Leu Lys Ala Ser Ala Gly ser Phe Pro Thr lie 
195 200 205 



gqa egg ate ggc ttc ttt ccg gaa gat gtt cct aag ccg aaa gtq gcg 
Gly Arg lie Gly Phe Phe Pro Glu Asp val Pro Lys Pro Lys Val Ala 
225 230 235 240 
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96 



144 



192 



240 



288 



ctt gaa teg gaa cct tgg gcg cga gca aga gtq agt gqt etc gtt gee 336 

Leu Glu Ser Glu Pro Trp Ala Arg Ala Arg Val Ser Gly Leu Val Ala 

100 105 ~ 110 

ate gac ctt gca cca ttt tgt atg gat ttc tec gaa gca caa ctt etc 384 

lie Asp Leu Ala Pro Phe Cys Met Asp Phe Ser Glu Ala Gin Leu Leu 

115 120 125 

caa acc ctg ttt ttg ctg age ggt aaa aga tgt gca tec age gat ctt 432 

Gin Thr Leu Phe Leu Leu Ser Gly Lys Arg Cys Ala Ser Ser Asp Leu 
130 135 140 

agt cat ttc gtq gee att tea ate tct aag act gee cgc tec cga acc 480 

Ser His Phe val Ala lie Ser lie Ser Lys Thr Ala Arg Ser Arg Thr 

145 150 155 160 

ctg caa atg ccg ccg tac gag aaa ggc acg acg aaa cgc gtt acc gqg 

Leu Gin Met Pro Pro Tyr Glu Lys Gly Thr Thr Lys Arg Val Thr Gly 
165 170 175 

ttt acc ctg acc ctt gaa gag gec gta cca ttt gac atg gta get tat 576 

Phe Thr Leu Thr Leu Glu Glu Ala Val Pro Phe Asp Met val Ala Tyr 

180 185 190 



528 



624 



gac ttg etc tat gac tac aga teg ttt ttt gac caa tgt tec gat att 672 
Asp Leu Leu Tyr Asp Tyr Arg ser Phe Phe Asp Gin Cys Ser Asp lie 
210 215 220 



720 



sequence listing 

ate att ggc get ggc att tec gga etc gtg gta gca age gaa ctg ctt 768 
lie lie Gly Ala Gly lie Ser Gly Leu val val Ala Ser Glu Leu Leu 
245 250 255 

cat get ggt gta gac gat gtt aca ata tat gaa gca agt gat egg gtt 816 
His Ala Gly val Asp Asp val Thr lie Tyr Glu Ala Ser Asp Arg val 
260 265 270 

gqa ggc aag ctt tgg tea cat get ttc aag gat get ccc age gtg gtg 864 
Gly Gly Lys Leu Trp Ser His Ala Phe Lys Asp Ala Pro Ser val val 
275 280 285 

gee gaa atg gqg gcg atg cga ttt cct cct get gca teg tgc ttg ttt 912 
Ala Glu Met Gly Ala Met Arg Phe Pro Pro Ala Ala Ser Cys Leu Phe 
290 295 300 

ttc ttc etc gag egg tac ggc ctg tct teg atg agg ccg ttc cca aat 960 
Phe Phe Leu Glu Arg Tyr Gly Leu Ser Ser Met Arg Pro Phe Pro Asn 
305 310 J15 320 

ccc ggc aca gtc gac act aac ttg gtc tac caa ggc etc cga tac gtg 1008 
Pro Gly Thr val Asp Thr Asn Leu val Tyr Gin Gly Leu Arg Tyr Val 
325 330 335 

tgg aaa gee gqg cag cag cca ccg aag ctg ttc cat cgc gtt tac age 1056 
Trp Lys Ala Gly Gin Gin Pro Pro Lys Leu Phe His Arg Val Tyr Ser 
340 345 350 

ggt tgg cgt gcg ttc ttg agg gac ggt ttc cat gag gga gat att gtg 1104 
Gly Trp Arg Ala Phe Leu Arg Asp Gly Phe His Glu Gly Asp lie val 
355 360 365 

ttg get teg cct gtt gtt att act caa gee ttg aaa tea gga gac att 1152 
Leu Ala Ser Pro val Val lie Thr Gin Ala Leu Lys Ser Gly Asp lie 
370 375 380 

agg egg get cat gac tec tgg caa act tgg ctg aac cgt ttc ggg agg 1200 
Arg Arg Ala His Asp Ser Trp Gin Thr Trp Leu Asn Arg Phe Gly Arg 
385 390 395 400 

gag tec ttc tct tea gcg ata gag agg ate ttt ctg ggc acg cat cct 1248 
Glu Ser Phe Ser Ser Ala lie Glu Arg lie Phe Leu Gly Thr His Pro 
405 410 415 

cct ggt ggt gaa aca tgg agt ttc cct cat gat tgg gac eta ttc aag 1296 
pro Gly Gly Glu Thr Trp Ser Phe Pro His Asp Trp Asp Leu Phe Lys 
420 425 430 

eta atg gqa ata gqa tct ggc gqg ttt ggt cca gtt ttt gaa age ggg 1344 
Leu Met Gly lie Gly Ser Gly Gly Phe Gly Pro Val Phe Glu Ser Gly 
435 440 445 

ttt att gag ate ctt cgc ttg gtc ata aac gga tat gaa gaa aat cag 1392 
Phe lie Glu lie Leu Arg Leu Val lie Asn Gly Tyr Glu Glu Asn Gin 
450 455 460 

egg atg tgc tct gaa gqa ate tea gaa ctt cca cgt cga ata gee tct 1440 
Arg Met Cys Ser Glu Gly lie ser Glu Leu Pro Arg Arg lie Ala Ser 
465 470 475 480 

caa gtg gtt aac ggt gtg tct gta age cag cgt ata cgc cat gtt caa 1488 
Gin val val Asn Gly Val Ser Val Ser Gin Arg lie Arg His val Gin 
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sequence listing 
485 490 495 

gtc agg gcg att gag aag gaa aag aca aaa ata aag ata agg ctt aag 1536 

val Arg Ala lie Glu Lys Glu Lys Thr Lys lie Lys lie Arg Leu Lys 
500 505 510 

age ggg ata tct gaa ctt tat gat aag gtg gtg gtt aca tct gga etc 1584 

Ser Gly lie Ser Glu Leu Tyr Asp Lys val Val val Thr Ser Gly Leu 

515 520 525 

gca aat ate caa etc agg cat tgt ctg aca tgc gat acc acc att ttt 1632 

Ala Asn lie Gin Leu Arg His Cys Leu Thr Cys Asp Thr Thr lie Phe 

530 535 540 



cgt gca cca gtq aac caa gcg gtt gat aac age cat atg aca ggc teg 
Arg Ala Pro Val Asn Gin Ala val Asp Asn Ser His Met Thr Gly Ser 
545 550 555 560 



aat gtt gtt caa cat gat tgg ctt aca gac gag aat gec ggg gga gc 
Asn val val Gin His Asp Trp Leu Thr Asp Glu Asn Ala Gly Gly Al 
660 665 670 



ttc aaa etc aac egg cgt gqc gag gat ttt tat tct gaa gaa ctt ttc 

Phe Lys Leu Asn Arg Arg Gly Glu Asp Phe Tyr Ser Glu Glu Leu Phe 
675 - 680 685 

ttt caa gcg ctg gac atg cct aat gat acc gga gtt tac ttg gcg ggt 

Phe Gin Ala Leu Asp Met Pro Asn Asp Thr Gly val Tyr Leu Ala Gly 

690 695 700 



gcg tgt aac gec gtc tgt gca att ate cac aat tgt gga ggt att ttg 
Ala Cys Asn Ala Val Cys Ala lie lie His Asn Cys Gly Gly lie Leu 
725 730 735 



1680 



tea aaa etc ttt ctg ctg act gaa cga aaa ttt tgg tta gac cat ate 1728 

Ser Lys Leu Phe Leu Leu Thr Glu Arg Lys Phe Trp Leu Asp His lie 

565 570 575 

etc ccg tec tgt gtc etc atg gac ggg ate gca aaa gca gtg tac tgc 1776 

Leu Pro Ser Cys Val Leu Met Asp Gly lie Ala Lys Ala val Tyr Cys 

580 585 590 

ttg gac tat gag ccg cag gat ccg aat ggt aaa ggt ctg gtg ccc ccc 1824 

Leu Asp Tyr Glu Pro Gin Asp Pro Asn Gly Lys Gly Leu val Pro Pro 

595 600 605 

act tat aca tgg gag gac gac tec cac aag ctg ttg gcg gtt ccc gac 1872 

Thr Tyr Thr Trp Glu Asp Asp Ser His Lys Leu Leu Ala Val Pro Asp 

610 615 620 

aaa aaa gag cga ttc tgt ctg ctg egg gac gca att teg aga tct ttc 1920 

Lys Lys Glu Arg Phe Cys Leu Leu Arg Asp Ala lie Ser Arg Ser Phe 

625 630 635 640 

ccg gcg ttt gec cag cat eta gtt cct gee tgc get gat tac gac caa 1968 

Pro Ala Phe Ala Gin His Leu val Pro Ala cys Ala Asp Tyr Asp Gin 

645 650 655 



get 2016 
"a 



2064 



2112 



tgc agt tgt tec ttc acc ggt gga tgg gtg gag ggc get att cag acc 2160 
Cys Ser Cys Ser Phe Thr Gly Gly Trp val Glu Gly Ala lie Gin Thr 
705 710 715 720 



2208 



gca aag gac aat cct etc gaa cac tct tgg aag aga tat aac tac cgc 2256 
Ala Lys Asp Asn Pro Leu Glu His Ser Trp Lys Arg Tyr Asn Tyr Arg 
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sequence listing 
740 745 750 



aat aga aat taa 
Asn Arg Asn 
755 



<210> 32 
<211> 755 
<212> PRT 

<213> Agrobacterium tumefaciens 

<400> 32 t . 

Met Ser Ala Ser Pro Leu Leu Asp Asn Gin Cys Asp His Phe Ser Thr 
15 10 15 

Lys Met val Asp Leu lie Met val Asp Lys Ala Asp Glu Leu Asp Arg 
20 25 30 

Arg Val Ser Asp Ala Phe Ser Glu Arg Glu Ala ser Arg Gly Arg Arg 
35 40 45 

He Thr Gin He ser Gly Glu Cys Ser Ala Gly Leu Ala Cys Lys Arg 
50 55 60 

Leu Ala Asp Gly Arg Phe Pro Glu He ser Thr Gly Glu Lys val Ala 
65 70 75 80 

Ala Leu ser Ala Tyr lie Tyr val Gly Lys Glu lie Leu Gly Arg lie 
85 90 95 

Leu Glu ser Glu Pro Trp Ala Arg Ala Arg val Ser Gly Leu val Ala 
100 105 110 

lie Asp Leu Ala Pro Phe Cys Met Asp Phe Ser Glu Ala Gin Leu Leu 

115 120 125 

Gin Thr Leu Phe Leu Leu Ser Gly Lys Arg cys Ala Ser Ser Asp Leu 
130 135 140 

Ser His Phe Val Ala He Ser lie Ser Lys Thr Ala Arg Ser Arg Thr 
145 150 155 160 

Leu Gin Met Pro Pro Tyr Glu Lys Gly Thr Thr Lys Arg Val Thr Gly 
165 170 175 

Phe Thr Leu Thr Leu Glu Glu Ala Val Pro Phe Asp Met Val Ala Tyr 
180 185 190 

Gly Arg Asn Leu Met Leu Lys Ala Ser Ala Gly Ser Phe Pro Thr lie 
195 200 205 

Asp Leu Leu Tyr Asp Tyr Arg Ser Phe Phe Asp Gin Cys Ser Asp lie 
210 215 220 

Gly Arg lie Gly Phe Phe Pro Glu Asp val Pro Lys Pro Lys val Ala 
225 230 235 240 

lie lie Gly Ala Gly lie Ser Gly Leu val val Ala Ser Glu Leu Leu 
245 250 255 

His Ala Gly val Asp Asp Val Thr lie Tyr Glu Ala Ser Asp Arg val 
260 265 270 
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sequence listing 
Gly Gly Lys Leu Trp ser His Ala Phe Lys Asp Ala Pro Ser val val 
275 280 285 

Ala Glu Met Gly Ala Met Arg Phe Pro Pro Ala Ala Ser Cys Leu Phe 
290 J 295 300 

Phe Phe Leu Glu Arg Tyr Gly Leu Ser Ser Met Arg Pro Phe Pro Asn 
305 310 315 320 

Pro Gly Thr val Asp Thr Asn Leu val Tyr Gin Gly Leu Arg Tyr val 
325 330 335 

Trp Lys Ala Gly Gin Gin Pro Pro Lys Leu Phe His Arg val Tyr Ser 
340 345 350 

Gly Trp Arg Ala Phe Leu Arg Asp Gly Phe His Glu Gly Asp lie Val 
355 360 365 

Leu Ala Ser Pro val Val lie Thr Gin Ala Leu Lys Ser Gly Asp lie 
370 375 380 

Arg Arg Ala His Asp Ser Trp Gin Thr Trp Leu Asn Arg Phe Gly Arg 
385 390 395 400 

Glu Ser Phe Ser Ser Ala lie Glu Arg lie Phe Leu Gly Thr His Pro 
405 410 415 

Pro Gly Gly Glu Thr Trp Ser Phe Pro His Asp Trp Asp Leu Phe Lys 
420 425 430 

Leu Met Gly lie Gly Ser Gly Gly Phe Gly Pro Val Phe Glu Ser Gly 
435 440 445 

Phe lie Glu lie Leu Arg Leu val lie Asn Gly Tyr Glu Glu Asn Gin 
450 455 460 

Arg Met Cys Ser Glu Gly lie Ser Glu Leu Pro Arg Arg lie Ala Ser 
465 470 475 480 

Gin val Val Asn Gly Val Ser Val Ser Gin Arg lie Arg His Val Gin 
485 490 495 

val Arg Ala lie Glu Lys Glu Lys Thr Lys lie Lys lie Arg Leu Lys 
500 505 510 

Ser Gly He Ser Glu Leu Tyr Asp Lys Val Val Val Thr Ser Gly Leu 
515 520 525 

Ala Asn lie Gin Leu Arg His Cys Leu Thr Cys Asp Thr Thr lie Phe 
530 535 540 

Arg Ala Pro val Asn Gin Ala val Asp Asn Ser His Met Thr Gly Ser 
545 550 555 560 

Ser Lys Leu Phe Leu Leu Thr Glu Arg Lys Phe Trp Leu Asp His lie 
565 570 575 

Leu Pro ser Cys val Leu Met Asp Gly lie Ala Lys Ala val Tyr Cys 
580 585 590 

Leu Asp Tyr Glu Pro Gin Asp Pro Asn Gly Lys Gly Leu val Pro Pro 
595 600 605 
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Thr Tyr Thr Trp Glu Asp Asp Ser His Lys Leu Leu Ala Val Pro Asp 

610 615 620 

Lys Lys Glu Arg Phe Cys Leu Leu Arg Asp Ala lie Ser Arg Ser Phe 
625 630 635 640 

pro Ala Phe Ala Gin His Leu val Pro Ala Cys Ala Asp Tyr Asp Gin 
645 650 655 

Asn val val Gin His Asp Trp Leu Thr Asp Glu Asn Ala Gly Gly Ala 
660 665 670 

Phe Lys Leu Asn Arg Arg Gly Glu Asp Phe Tyr Ser Glu Glu Leu Phe 
675 ^ 680 685 

Phe Gin Ala Leu Asp Met Pro Asn Asp Thr Gly val Tyr Leu Ala Gly 
690 695 700 

Cys Ser Cys Ser Phe Thr Gly Gly Trp Val Glu Gly Ala lie Gin Thr 
705 710 715 720 

Ala Cys Asn Ala val Cys Ala lie lie His Asn Cys Gly Gly lie Leu 
725 730 735 

Ala Lys Asp Asn Pro Leu Glu His Ser Trp Lys Arg Tyr Asn Tyr Arg 
740 745 750 

Asn Arg Asn 
755 



<210> 33 
<211> 1404 
<212> DNA 

<213> Agrobacterium tumefaciens 

<220> 

<221> CDS 

<222> (1)..(1401) 

<223> coding for indoleacetamide hydrolase 
<400> 33 

atg gtg ccc att acc teg tta gca caa acc eta gaa cgc ctg aga egg 
Met val Pro lie Thr Ser Leu Ala Gin Thr Leu Glu Arg Leu Arg Arg 
1 5 10 15 

aaa gac tac tec tgc tta gaa eta gta gaa act ctg ata gcg cgt tgc 
Lys Asp Tyr Ser Cys Leu Glu Leu Val Glu Thr Leu lie Ala Arg Cys 
20 25 30 

caa get gca aaa cca tta aat gec ctt ctg get aca gac tgg gat ggc 
Gin Ala Ala Lys Pro Leu Asn Ala Leu Leu Ala Thr Asp Trp Asp Gly 
35 40 45 



ggt ctt tgc ggc att cca etc tgt ttt aag gcg aac ate gcg acc ggc 
Gly Leu Cys Gly lie Pro Leu Cys Phe Lys Ala Asn lie Ala Thr Gly 
65 70 75 80 



48 



96 



144 



ttg egg cga age gee aaa aaa aat gat cgt cat gga aac gee gqa tta 192 
Leu Arg Arg Ser Ala Lys Lys Asn Asp Arg His Gly Asn Ala Gly Leu 
50 55 60 



240 



gta ttt cct aca age get get act ccg gcg ctg ata aac cac ttg cca 288 

val Phe Pro Thr Ser Ala Ala Thr Pro Ala Leu lie Asn His Leu Pro 
85 90 95 
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aag ata cca tec cgc gtc gca gaa aga ctt ttt tea get gqa gca ctg 

Lys lie Pro Ser Arg val Ala Glu Arg Leu Phe Ser Ala Gly Ala Leu 

100 105 110 

ccg ggt gec teg gqa aac atg cat gag tta teg ttt gqa att acg age 

Pro Gly Ala Ser Gly Asn Met His Glu Leu Ser Phe Gly lie Thr ser 
115 120 125 

aac aac tat gee acc ggt gcg gtg egg aac ccg tgg aat cca agt ctg 

Asn Asn Tyr Ala Thr Gly Ala Val Arg Asn Pro Trp Asn Pro Ser Leu 

130 135 140 

ata cca gag ggt tea age ggt ggt gtg get get gcg gtg gca age cga 

lie Pro Gly Gly Ser Ser Gly Gly val Ala Ala Ala val Ala Ser Arg 

145 150 155 160 

ttg atg tta ggc ggc ata ggc acg gat acc ggt gca tct gtt cgc eta 

Leu Met Leu Gly Gly lie Gly Thr Asp Thr Gly Ala Ser val Arg Leu 

165 170 175 



180 185 



195 200 



210 215 

att tec gga egg teg gcg aaa 

lie ser Gly Arg Ser Ala Lys 

225 230 



Arq He Gly Leu Pro 
245 



260 265 

gta acc ttt gtt gaa gec gac ate ccc 

val Thr Phe val Glu Ala Asp lie Pro 

275 280 

ggg gca agt ttg cca att gcg ctt tac 

Gly Ala Ser Leu Pro lie Ala Leu Tyr 

290 295 

aag tat etc gac gat ttt gtg gga aca 

Lys Tyr Leu Asp Asp Phe val Gly Thr 

305 310 

aaa gga att cgt age ccc gat gta gcg 

Lys Gly lie Arg Ser Pro Asp val Ala 
325 



336 



384 



432 



480 



528 



ttt 
Phe 


cga 
Arg 


ccg 
Pro 


acg 
Thr 


ctt 
Leu 
190 


ggt 
Gl v 


cga 
Arg 


576 


age 
Ser 


ccc 
Pro 


acc 
Thr 


egg 
Arg 
205 


gac 

Asp 


acc 
Thr 


gee 
Ala 


624 


gtt 
Val 


ata 
lie 


ate 
lie 
220 


etc 
Leu 


gac 

ASp 


cag 
Gin 


gtg 
Val 


672 


ccc 
Pro 


atg 
Met 
235 


ccg 
Pro 


ctg 
Leu 


aag 

1 \/c 

Lys 


ggg 

o i y 


ctt 
Leu 
240 


720 


tac 
Tyr 
250 


gat 
Asp 


gac 
Asp 


ctt 
Leu 


gat 
Asp 


get 
Ala 
255 


gat 
Asp 


768 


cgc 
Arg 


ttg 
Leu 


eta 
Leu 


gec 
Ala 


aac 
Asn 
270 


aga 
Arg 


ggc 
Gly 


816 


cac 
His 


eta 
Leu 


gag 
Glu 


gaa 
Glu 
285 


ttg 
Leu 


aac 
Asn 


agt 
Ser 


864 


gaa 
Glu 


ttt 
Phe 


cca 
Pro 
300 


cac 
His 


get 
Ala 


eta 
Leu 


aaa 
Lys 


912 


gtt 
val 


tct 
Ser 
315 


ttt 
Phe 


tct 
Ser 


gac 

Asp 


gtt 
val 


ate 
He 
320 


960 


aac 
Asn 
330 


att 
lie 


gtc 
val 


agt 
Ser 


gcg 
Ala 


caa 
Gin 
335 


att 
He 


1008 


tat gaa 
Tyr Glu 
Page 


ctg 
Leu 
52 


gcg 
Ala 


cgt 
Arg 


caa 
Gin 


tec 
Ser 


1056 



sequence listing 
340 345 350 

ttc agg cca agg etc cag gec act tat egg aat tac ttc aga etc tat 1104 
Phe Arg Pro Arg Leu Gin Ala Thr Tyr Arg Asn Tyr Phe Arg Leu Tyr 
355 360 365 

cag tta gat gca ate ctt ttc cca act gca ccc tta gcg gec aaa gec 1152 
Gin Leu Asp Ala lie Leu Phe Pro Thr Ala Pro Leu Ala Ala Lys Ala 
370 375 380 



ata ggt cag gag teg tea gtc ate cac aat gqc tea atg atg aac act 
lie Gly Gin Glu ser Ser val lie His Asn Gly Ser Met Met Asn Thr 
385 390 395 400 



gga atg gaa att gat gga tta gcg ggg tea gac cac cgt ctg tta gca 
Gly Met Glu lie Asp Gly Leu Ala Gly Ser Asp His Arg Leu Leu Ala 
435 440 ' 445 



1200 



ttc aag ate tac gtq cga aat gtg gac cca age age aac gca gqc eta 1248 

Phe Lys lie Tyr Val Arg Asn Val Asp Pro Ser Ser Asn Ala Gly Leu 
405 410 415 

cct ggg ttg age ctt cct gec tgc ctt aca cct gat cgc ttg cct gtt 1296 

Pro Gly Leu Ser Leu Pro Ala Cys Leu Thr Pro Asp Arg Leu Pro val 

420 425 430 



1344 



ate ggg gca gca tta gaa aaa get ata aat ttt tct tec ttt ccc gat 1392 
lie Gly Ala Ala Leu Glu Lys Ala lie Asn Phe Ser Ser Phe Pro Asp 
450 455 460 

get ttt aat tag 1404 

Ala Phe Asn 

465 

<210> 34 
<211> 467 
<212> PRT 

<213> Agrobacterium tumefaciens 
<400> 34 

Met val Pro lie Thr ser Leu Ala Gin Thr Leu Glu Arg Leu Arg Arg 
1 5 10 15 

Lys Asp Tyr Ser Cys Leu Glu Leu Val Glu Thr Leu lie Ala Arg Cys 
20 25 30 

Gin Ala Ala Lys Pro Leu Asn Ala Leu Leu Ala Thr Asp Trp Asp Gly 
35 40 45 

Leu Arg Arg ser Ala Lys Lys Asn Asp Arg His Gly Asn Ala Gly Leu 
50 55 60 

Gly Leu Cys Gly lie Pro Leu Cys Phe Lys Ala Asn lie Ala Thr Gly 
65 70 75 80 

val Phe Pro Thr ser Ala Ala Thr Pro Ala Leu lie Asn His Leu Pro 
85 90 95 

Lys lie Pro Ser Arg Val Ala Glu Arg Leu Phe Ser Ala Gly Ala Leu 
100 105 110 

pro Gly Ala Ser Gly Asn Met His Glu Leu Ser Phe Gly lie Thr ser 
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115 120 125 

Asn Asn Tyr Ala Thr Gly Ala val Arg Asn Pro Trp Asn Pro Ser Leu 
130 135 140 

lie Pro Gly Gly Ser Ser Gly Gly Val Ala Ala Ala Val Ala Ser Arg 
145 150 155 160 

Leu Met Leu Gly Gly lie Gly Thr Asp Thr Gly Ala Ser val Arg Leu 
165 170 175 

Pro Ala Ala Leu Cys Gly val Val Gly Phe Arg Pro Thr Leu Gly Arg 
180 185 190 

Tyr Pro Arg Asp Arg lie lie Pro Phe Ser Pro Thr Arg Asp Thr Ala 
195 200 205 

Gly lie lie Ala Gin Cys val Ala Asp Val lie lie Leu Asp Gin val 
210 215 220 

lie Ser Gly Arg Ser Ala Lys lie Ser Pro Met Pro Leu Lys Gly Leu 
225 230 235 240 

Arq lie Gly Leu Pro Thr Thr Tyr Phe Tyr Asp Asp Leu Asp Ala Asp 
245 250 255 

val Ala Phe Ala Ala Glu Thr Thr lie Arg Leu Leu Ala Asn Arg Gly 
260 265 270 

val Thr Phe val Glu Ala Asp lie Pro His Leu Glu Glu Leu Asn Ser 
275 280 285 

Gly Ala Ser Leu Pro lie Ala Leu Tyr Glu Phe Pro His Ala Leu Lys 
290 295 300 

Lys Tyr Leu Asp Asp Phe Val Gly Thr val Ser Phe Ser Asp val lie 
305 310 315 320 

Lys Gly lie Arg Ser Pro Asp Val Ala Asn lie Val Ser Ala Gin lie 
325 330 335 

Asp Gly His Gin lie ser Asn Asp Glu Tyr Glu Leu Ala Arg Gin Ser 
340 345 350 

Phe Arg Pro Arg Leu Gin Ala Thr Tyr Arg Asn Tyr Phe Arg Leu Tyr 
355 360 365 

Gin Leu Asp Ala lie Leu Phe Pro Thr Ala Pro Leu Ala Ala Lys Ala 
370 375 380 

lie Gly Gin Glu Ser Ser val lie His Asn Gly Ser Met Met Asn Thr 
385 390 395 400 

Phe Lys lie Tyr val Arg Asn val Asp Pro Ser Ser Asn Ala Gly Leu 
405 410 415 

Pro Gly Leu Ser Leu Pro Ala Cys Leu Thr Pro Asp Arg Leu Pro val 
420 425 430 

Gly Met Glu lie Asp Gly Leu Ala Gly Ser Asp His Arg Leu Leu Ala 
435 440 445 

lie Gly Ala Ala Leu Glu Lys Ala lie Asn Phe Ser ser Phe Pro Asp 
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450 455 460 

Ala Phe Asn 
465 

<210> 35 
<211> 1419 
<212> DNA 

<213> Agrobacterium vitis 

<220> 

<221> CDS 

<222> (1)..(1416) 

<223> coding for indoleacetamide hydrolase 
<400> 35 

atg gtq acc eta gqt tea ate aag gaa acc ctg gaa tgt etc agg ctg 48 
Met val Thr Leu Gly ser lie Lys Glu Thr Leu Glu cys Leu Arg Leu 
1 5 10 15 

aaa aaa tac tec tgt tec gaa ctg get gaa acc ata ata gec cgt tgc 96 
Lys Lys Tyr Ser Cys Ser Glu Leu Ala Glu Thr lie lie Ala Arg Cys 
20 25 30 

gaa gec gcg aaa tct etc aat get ctt ctg gcg act gac tgg gat tac 144 
Glu Ala Ala Lys Ser Leu Asn Ala Leu Leu Ala Thr Asp Trp Asp Tyr 
35 40 45 

ctg egg cgt aat gee aag aaa gta gat gaa gat gqa age gec gqc gag 192 
Leu Arg Arg Asn Ala Lys Lys Val Asp Glu Asp Gly Ser Ala Gly Glu 



50 55 60 



gqt ctt gec gqc ate ccg ctg tgt tct aaa gcg aac att gca aca gqc 
Gly Leu Ala Gly lie Pro Leu Cys Ser Lys Ala Asn lie Ala Thr Gly 
65 70 75 80 



tta atg etc gqc gqa att gqc acc gac acg ggg get teg gtc cgc eta 
Leu Met Leu Gly Gly lie Gly Thr Asp Thr Gly Ala Ser Val Arg Leu 
165 170 175 

cct gca tec eta tgt gqc gta gtq gqa ttc cgc ccg acg ate ggc aga 
pro Ala Ser Leu cys Gly val Val Gly Phe Arg Pro Thr lie Gly Arg 
180 185 190 
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240 



ata ttc cca gca age gcg gee acg ccg gcg ctt gat gaa cat tta cct 288 

lie Phe Pro Ala Ser Ala Ala Thr Pro Ala Leu Asp Glu His Leu Pro 

85 90 95 

aca aca cca gec gqc gtc cgt aaa ccg ctt eta gac get ggg gca ctg 336 

Thr Thr Pro Ala Gly Val Arg Lys Pro Leu Leu Asp Ala Gly Ala Leu 

100 105 110 

ata gqc get teg gqa aac atg cat gag tta teg ttt gqc att acc agt 384 

lie Gly Ala Ser Gly Asn Met His Glu Leu Ser Phe Gly lie Thr Ser 

115 120 125 

aac aac cac gec act gqt gcg gtq aga aac ccc tgg aat ccc age tta 432 

Asn Asn His Ala Thr Gly Ala Val Arg Asn Pro Trp Asn Pro Ser Leu 

130 135 140 

ata cca gqa gqc teg age gqc gqc gtq get get get gta gca tea egg 480 

lie Pro Gly Gly Ser Ser Gly Gly Val Ala Ala Ala Val Ala Ser Arg 

145 150 155 160 



528 



576 



sequence listing 

tat cct gga gac cga att gtq ccg gtt age ccc acc cgc gat aca gec 
Tyr Pro Gly Asp Arg lie Val Pro val Ser Pro Thr Arg Asp Thr Ala 
195 200 205 



att tgc ggg aag etc acg acc cac caa cct gta ccc ctg gag gga tta 
lie Cys Gly Lys Leu Thr Thr His Gin Pro val Pro Leu Glu Gly Leu 
225 230 235 240 



gtq gec ttc gca get gaa aac ctt ate acg ctg ctg gee age aag ggt 
val Ala Phe Ala Ala Glu Asn Leu lie Thr Leu Leu Ala Ser Lys Gly 
260 265 270 



ggg gtt age ttt cct att gee ctg tac gag ttt ccg ttc gec eta caa 

Gly val Ser Phe Pro lie Ala Leu Tyr Glu Phe Pro Phe Ala Leu Gin 

290 295 300 

aag tat ate gat gac ttt gtq aag gat gtq tct ttt tct gac gtc ate 

Lys Tyr lie Asp Asp Phe val Lys Asp Val Ser Phe Ser Asp Val lie 

305 310 315 320 



624 



gga att ate gca cag age gtt cct gat gtq ata etc ctt gac caa ate 672 
Gly lie lie Ala Gin Ser val Pro Asp Val lie Leu Leu Asp Gin lie 
210 215 220 



720 



cgt ate gqc ttg cca acc act tac ttt tac gat gac ctt gat get gat 768 
Arg lie Gly Leu Pro Thr Thr Tyr Phe Tyr Asp Asp Leu Asp Ala Asp 
245 250 255 



816 



gta acc ttt gtt aag gec gag att cca gat ctg cag cgt ctg aac ate 864 
val Thr Phe val Lys Ala Glu lie Pro Asp Leu Gin Arg Leu Asn lie 
275 280 285 



912 



960 



aaa gga att cgt age cct gat gta gee aac att gec aat get caa att 1008 

Lys Gly lie Arg ser Pro Asp Val Ala Asn lie Ala Asn Ala Gin lie 

325 330 335 

gat gga cat caa att tec aaa get tea tat gaa ctg gcg cga caa tct 1056 

Asp Gly His Gin lie Ser Lys Ala Ser Tyr Glu Leu Ala Arg Gin ser 

340 345 350 

ttc aga cca aag ctg caa gee gee tac cat gat tac ttc aag ctg cac 1104 

Phe Arg Pro Lys Leu Gin Ala Ala Tyr His Asp Tyr Phe Lys Leu His 

355 360 365 

cag eta gac gcg ate ctt ttc ccg aca get ccc ctg aca gec aaa ccg 1152 

Gin Leu Asp Ala lie Leu Phe Pro Thr Ala Pro Leu Thr Ala Lys Pro 

370 375 380 

ate gqc caa gat tta teg gtq atg cac aat gqc gta atg gec gac acg 1200 

lie Gly Gin Asp Leu ser val Met His Asn Gly val Met Ala Asp Thr 

385 390 395 400 

ttt aaa ate ttc gtq cga aat gtq gat ccg ggg age aac gca gqc ctg 

Phe Lys lie Phe val Arg Asn val Asp Pro Gly Ser Asn Ala Gly Leu 

405 410 415 

cca gga tta age ctt ccc gtt tct ctt act tea aag ggt ttg cct att 1296 

Pro Gly Leu Ser Leu Pro val ser Leu Thr Ser Lys Gly Leu Pro lie 

420 425 430 

gga atg gaa ate gat gga tta gcg gqc atg gac gac cgt ttg eta gca 1344 

Gly Met Glu lie Asp Gly Leu Ala Gly Met Asp Asp Arg Leu Leu Ala 
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435 440 445 

ate gga gcg gca eta gag gaa gcg ata get ttt cat aat tta cct gac 1392 
lie Gly Ala Ala Leu Glu Glu Ala lie Ala Phe His Asn Leu Pro Asp 
450 455 460 

ttc ccg aaa gtc gag aca aac tac tga 1419 
Phe Pro Lys val Glu Thr Asn Tyr 
465 470 

<210> 36 
<211> 472 
<212> PRT 

<213> Agrobacterium vitis 
<400> 36 

Met val Thr Leu Gly Ser lie Lys Glu Thr Leu Glu Cys Leu Arg Leu 
15 10 15 

Lys Lys Tyr Ser Cys Ser Glu Leu Ala Glu Thr lie lie Ala Arg Cys 
20 25 30 

Glu Ala Ala Lys Ser Leu Asn Ala Leu Leu Ala Thr Asp Trp Asp Tyr 
35 40 45 

Leu Arg Arg Asn Ala Lys Lys val Asp Glu Asp Gly Ser Ala Gly Glu 
50 ^ 55 60 

Gly Leu Ala Gly lie Pro Leu Cys Ser Lys Ala Asn lie Ala Thr Gly 
65 70 75 80 

lie Phe Pro Ala Ser Ala Ala Thr Pro Ala Leu Asp Glu His Leu Pro 
85 90 95 

Thr Thr Pro Ala Gly val Arg Lys Pro Leu Leu Asp Ala Gly Ala Leu 
100 105 110 

lie Gly Ala Ser Gly Asn Met His Glu Leu Ser Phe Gly lie Thr Ser 
115 120 125 

Asn Asn His Ala Thr Gly Ala Val Arg Asn Pro Trp Asn Pro Ser Leu 
130 135 140 

lie Pro Gly Gly Ser Ser Gly Gly val Ala Ala Ala val Ala Ser Arg 
145 150 155 160 

Leu Met Leu Gly Gly lie Gly Thr Asp Thr Gly Ala Ser val Arg Leu 
165 170 175 

Pro Ala Ser Leu Cys Gly val Val Gly Phe Arg Pro Thr lie Gly Arg 
180 185 ~ 190 

Tyr Pro Gly Asp Arg lie val Pro Val Ser Pro Thr Arg Asp Thr Ala 
195 200 205 

Gly lie He Ala Gin Ser Val Pro Asp val lie Leu Leu Asp Gin lie 
210 215 220 

lie Cys Gly Lys Leu Thr Thr His Gin Pro val Pro Leu Glu Gly Leu 
225 230 235 240 

Arg lie Gly Leu Pro Thr Thr Tyr Phe Tyr Asp Asp Leu Asp Ala Asp 
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val Ala Phe Ala Ala Glu Asn Leu lie Thr Leu Leu Ala Ser Lys Gly 
260 265 270 

val Thr Phe Val Lys Ala Glu lie Pro Asp Leu Gin Arg Leu Asn lie 
275 280 285 

Gly Val Ser Phe Pco lie Ala Leu Tyr Glu Phe Pro Phe Ala Leu Gin 
290 295 300 

Lys Tyr lie Asp Asp Phe Val Lys Asp Val ser Phe Ser Asp val lie 
305 310 315 320 

Lys Gly lie Arg ser Pro Asp Val Ala Asn lie Ala Asn Ala Gin lie 
325 330 335 

Asp Gly His Gin lie Ser Lys Ala ser Tyr Glu Leu Ala Arg Gin Ser 
340 345 350 

Phe Arg Pro Lys Leu Gin Ala Ala Tyr His Asp Tyr Phe Lys Leu His 
355 360 365 

Gin Leu Asp Ala lie Leu Phe Pro Thr Ala Pro Leu Thr Ala Lys Pro 
370 375 380 

lie Gly Gin Asp Leu Ser Val Met His Asn Gly Val Met Ala Asp Thr 
385 390 395 400 

Phe Lys lie Phe Val Arg Asn Val Asp Pro Gly Ser Asn Ala Gly Leu 
405 410 415 

Pro Gly Leu Ser Leu Pro val Ser Leu Thr Ser Lys Gly Leu Pro lie 
420 425 430 

Gly Met Glu lie Asp Gly Leu Ala Gly Met Asp Asp Arg Leu Leu Ala 
435 440 445 

lie Gly Ala Ala Leu Glu Glu Ala lie Ala Phe His Asn Leu Pro Asp 
450 455 460 

Phe Pro Lys Val Glu Thr Asn Tyr 
465 470 



<210> 37 
<211> 1263 
<212> DNA 

<213> Arabidopsis thai i ana 

<220> 

<221> CDS 

<222> (1)..(1260) 

<223> coding for 5-methylthioribose kinase 
<400> 37 

atg tct ttt gag gag ttt acg ccg tta aac gag aag tct ctt gta gac 
Met Ser Phe Glu Glu Phe Thr Pro Leu Asn Glu Lys Ser Leu Val Asp 
1 5 10 15 

tac ate aag tea aca cct get etc tct tec aag ate gga gee gac aag 
Tyr lie Lys Ser Thr Pro Ala Leu Ser Ser Lys lie Gly Ala Asp Lys 
20 25 30 
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tec gat gat gat ttg gtt ate aaa gaa gtt gga gat gqc aat etc aat 144 
Ser Asp Asp Asp Leu Val lie Lys Glu Val Gly Asp Gly Asn Leu Asn 
35 40 45 

ttc gtt ttc ate gtt gtt gga tec tct ggt tct ctt gtc ate aaa cag 192 
Phe val Phe lie val val Gly Ser Ser Gly Ser Leu val lie Lys Gin 
50 55 60 

get ctt cca tat att cgc tgt ate ggt gaa tea tgg cca atg acg aaa 240 
Ala Leu Pro Tyr lie Arg cys lie Gly Glu Ser Trp Pro Met Thr Lys 
65 70 75 80 

gaa aga get tat ttt gaa gca aca act ttg aga aag cat gga aat tta 288 
Glu Arg Ala Tyr Phe Glu Ala Thr Thr Leu Arg Lys His Gly Asn Leu 
85 90 95 

tea cct gat cat gtt cct gaa gtc tac cat ttt gac aga aca atg gcg 336 
Ser Pro Asp His Val Pro Glu val Tyr His Phe Asp Arg Thr Met Ala 
100 105 110 

ttg att gga atg aga tac ctt gag cct cct cat ate att etc cgc aaa 384 
Leu lie Gly Met Arg Tyr Leu Glu Pro Pro His lie lie Leu Arg Lys 
115 120 125 

gga etc att get ggg att gag tat cct ttc etc gca gac cac atg tct 432 
Gly Leu lie Ala Gly lie Glu Tyr Pro Phe Leu Ala Asp His Met Ser 
130 135 140 

gat tac atg gcg aag act etc ttc ttc act tct etc etc tat cac gat 480 
Asp Tyr Met Ala Lys Thr Leu Phe Phe Thr Ser Leu Leu Tyr His Asp 
145 150 155 160 

acc aca gag cac aga aga gca gta acc gaa ttt tgt ggt aat gtg gag 528 
Thr Thr Glu His Arg Arg Ala Val Thr Glu Phe Cys Gly Asn val Glu 
165 170 175 

tta tgc cga tta acg gag caa gtt gtg ttt teg gac cca tat aga gtt 576 
Leu cys Arg Leu Thr Glu Gin Val val Phe Ser Asp Pro Tyr Arg val 
180 185 190 

tec aca ttt aat cgt tgg act tea cct tat ctt gat gat gat get aag 624 
Ser Thr Phe Asn Arg Trp Thr Ser Pro Tyr Leu Asp Asp Asp Ala Lys 
195 " 200 205 

get gtg cgc gaa gac agt gee ttg aag etc gaa ate gca gag eta aaa 672 
Ala val Arg Glu Asp Ser Ala Leu Lys Leu Glu lie Ala Glu Leu Lys 
210 215 220 

teg atg ttc tgt gaa aga get caa get tta ata cat ggt gat ctt cat 720 
Ser Met Phe Cys Glu Arg Ala Gin Ala Leu lie His Gly Asp Leu His 
225 230 235 240 

act ggt tct gtc atg gtt act caa gat tea acg caa gtt ata gat cca 768 
Thr Gly Ser val Met val Thr Gin Asp Ser Thr Gin val He Asp Pro 
245 250 255 

gag ttt teg ttc tat gga ccg atg ggt ttc gat att ggc get tat ctt 816 
Glu Phe Ser Phe Tyr Gly Pro Met Gly Phe Asp lie Gly Ala Tyr Leu 
260 265 270 

ggt aac ttg ata eta get ttc ttt gca caa gat gga cac gee act cag 864 
Gly Asn Leu lie Leu Ala Phe Phe Ala Gin Asp Gly His Ala Thr Gin 
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275 280 285 

gaa aat gat cga aaa gaa tac aag cag tgg ate ttg aga acc att gag 912 
Glu Asn Asp Arg Lys Glu Tyr Lys Gin Trp lie Leu Arg Thr lie Glu 
290 295 300 

caa act tgg aat ttg ttt aac aaa agg ttc att gcg eta tgg gat caa 960 
Gin Thr Trp Asn Leu Phe Asn Lys Arg Phe lie Ala Leu Trp Asp Gin 
305 310 315 320 



aac aaa gat gga cca ggc gaa gca tac ctt gca gat ate tat aac aat 
Asn Lys Asp Gly Pro Gly Glu Ala Tyr Leu Ala Asp lie Tyr Asn Asn 
325 330 335 



aag gaa agg aga aag ttt aag agt ate ggt gaa gtt gtt tea gca att 
Lys Glu Arg Arg Lys Phe Lys Ser lie Gly Glu val Val Ser Ala lie 
405 410 415 

caa caa caa age taa 
Gin Gin Gin Ser 
420 

<210> 38 
<211> 420 
<212> PRT 

<213> Arabidopsis thai i ana 
<400> 38 

Met Ser Phe Glu Glu Phe Thr Pro Leu Asn Glu Lys Ser Leu val Asp 
1 5 10 15 

Tyr lie Lys Ser Thr Pro Ala Leu Ser Ser Lys lie Gly Ala Asp Lys 
20 25 30 

Ser Asp Asp Asp Leu val lie Lys Glu Val Gly Asp Gly Asn Leu Asn 
35 40 45 

Phe val Phe lie val val Gly Ser Ser Gly ser Leu val lie Lys Gin 
50 55 60 

Ala Leu Pro Tyr lie Arg Cys lie Gly Glu Ser Trp Pro Met Thr Lys 
65 70 75 80 

Glu Arg Ala Tyr Phe Glu Ala Thr Thr Leu Arg Lys His Gly Asn Leu 
85 90 95 

Ser Pro Asp His Val Pro Glu val Tyr His Phe Asp Arg Thr Met Ala 
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1008 



acc gag gtt ttg aag ttt gtt caa gaa aac tac atg agg aat ttg ttg 1056 

Thr Glu val Leu Lys Phe Val Gin Glu Asn Tyr Met Arg Asn Leu Leu 

340 345 350 

cat gac tea etc gga ttc ggc get gca aag atg att agg aga att gtg 1104 

His Asp Ser Leu Gly Phe Gly Ala Ala Lys Met lie Arg Arg lie val 

355 360 365 

gga gtg gca cat gtt gag gac ttt gaa tea ate gaa gaa gat aag cga 

Gly val Ala His val Glu Asp Phe Glu Ser lie Glu Glu Asp Lys Arg 

370 375 380 

aga get att tgc gag aga agt gca etc gag ttt gcg aag atg ctt etc 1200 

Arg Ala lie Cys Glu Arg Ser Ala Leu Glu Phe Ala Lys Met Leu Leu 

385 390 395 400 



1152 



1248 



1263 



sequence listing 
100 105 110 

Leu lie Gly Met Arg Tyr Leu Glu Pro Pro His lie lie Leu Arg Lys 
115 120 125 

Gly Leu lie Ala Gly lie Glu Tyr Pro Phe Leu Ala Asp His Met Ser 
130 135 140 

Asp Tyr Met Ala Lys Thr Leu Phe Phe Thr Ser Leu Leu Tyr His Asp 
145 150 155 160 

Thr Thr Glu His Arg Arg Ala val Thr Glu Phe Cys Gly Asn val Glu 
165 170 175 

Leu Cys Arg Leu Thr Glu Gin Val val Phe Ser Asp Pro Tyr Arg val 
180 185 190 

Ser Thr Phe Asn Arg Trp Thr Ser Pro Tyr Leu Asp Asp Asp Ala Lys 
195 " 200 205 

Ala val Arg Glu Asp Ser Ala Leu Lys Leu Glu lie Ala Glu Leu Lys 
210 215 220 

Ser Met Phe Cys Glu Arg Ala Gin Ala Leu lie His Gly Asp Leu His 
225 230 235 240 

Thr Gly ser val Met val Thr Gin Asp Ser Thr Gin val lie Asp Pro 
245 250 255 

Glu Phe Ser Phe Tyr Gly Pro Met Gly Phe Asp lie Gly Ala Tyr Leu 
260 265 270 

Gly Asn Leu lie Leu Ala Phe Phe Ala Gin Asp Gly His Ala Thr Gin 
275 280 285 

Glu Asn Asp Arg Lys Glu Tyr Lys Gin Trp lie Leu Arg Thr lie Glu 
290 295 300 

Gin Thr Trp Asn Leu Phe Asn Lys Arg Phe lie Ala Leu Trp Asp Gin 
305 310 " 315 320 

Asn Lys Asp Gly Pro Gly Glu Ala Tyr Leu Ala Asp lie Tyr Asn Asn 
325 330 335 

Thr Glu val Leu Lys Phe val Gin Glu Asn Tyr Met Arg Asn Leu Leu 
340 345 350 

His Asp Ser Leu Gly Phe Gly Ala Ala Lys Met lie Arg Arg lie Val 
355 360 365 

Gly val Ala His val Glu Asp Phe Glu Ser lie Glu Glu Asp Lys Arg 
370 375 380 

Arg Ala lie cys Glu Arg Ser Ala Leu Glu Phe Ala Lys Met Leu Leu 
385 390 395 400 

Lys Glu Arg Arg Lys Phe Lys Ser lie Gly Glu Val val Ser Ala lie 
405 410 415 

Gin Gin Gin Ser 
420 

<210> 39 
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<211> 1200 
<212> DNA 

<213> Klebsiella pneumoniae 

<220> 

<221> CDS 

<222> (1)..(1197) 

<223> coding for 5-methylthioribose kinase 
<400> 39 

atg teg caa tac cat acc ttc acc gec cac gat gec gtq get tac gcg 48 
Met Ser Gin Tyr His Thr Phe Thr Ala His Asp Ala Val Ala Tyr Ala 
15 10 15 

caa cag ttc gec gqc ate gac aac cca tct gag ctg gtc age gcg cag 96 
Gin Gin Phe Ala Gly lie Asp Asn Pro Ser Glu Leu val Ser Ala Gin 
20 25 30 

gaa gtq gqc gat gqc aac etc aat ctg gtq ttt aaa gtq ttc gat cgt 
Glu Val Gly Asp Gly Asn Leu Asn Leu Val Phe Lys val Phe Asp Arg 
35 40 45 



tgc gtc gqc gaa tec tgg ccg ctg acc etc gac cgc gec cgt etc gaa 
Cys val Gly Glu Ser Trp Pro Leu Thr Leu Asp Arg Ala Arg Leu Glu 
65 70 75 80 



144 



cag gqc gtc age egg gcg ate gtc aaa cag gec ctg ccc tac gtq cgc 192 
Gin Gly val Ser Arg Ala lie val Lys Gin Ala Leu Pro Tyr val Arg 
50 55 60 



240 



gcg cag acc ctg gtc gee cac tat cag cac age ccg cag cac acg gta 288 
Ala Gin Thr Leu Val Ala His Tyr Gin His ser Pro Gin His Thr val 
85 90 95 

aaa ate cat cac ttt gat ccc gag ctg gcg gtq atg gtq atg gaa gat 336 
Lys lie His His Phe Asp Pro Glu Leu Ala val Met val Met Glu Asp 
100 105 110 

ctt tec gac cac cgc ate tgg cgc gga gag ctt ate get aac gtc tac 384 
Leu Ser Asp His Arg lie Trp Arg Gly Glu Leu lie Ala Asn val Tyr 
115 120 125 

tat ccc cag gcg gee cgc cag ctt gqc gac tat ctg gcg cag gtq ttg 432 
Tyr Pro Gin Ala Ala Arg Gin Leu Gly Asp Tyr Leu Ala Gin Val Leu 
130 135 140 

ttc cac acc age gat ttc tac etc cat ccc cac gag aaa aag gcg cag 480 
Phe His Thr Ser Asp Phe Tyr Leu His Pro His Glu Lys Lys Ala Gin 
145 150 155 160 

gtq gcg cag ttt att aac ccg gcg atg tgc gag ate acc gag gat ctg 528 
val Ala Gin Phe lie Asn Pro Ala Met Cys Glu lie Thr Glu Asp Leu 
165 170 175 

ttc ttt aac gac ccg tat cag ate cac gag cgc aat aac tac ccg gcg 576 
Phe Phe Asn Asp Pro Tyr Gin lie His Glu Arg Asn Asn Tyr Pro Ala 
180 185 190 

gag ctg gag gec gat gtc gec gec ctg cgc gac gac gec cag ctt aag 624 
Glu Leu Glu Ala Asp val Ala Ala Leu Arg Asp Asp Ala Gin Leu Lys 
195 200 205 

ctg gcg gtq gcg gcg ctg aag cac cgt ttc ttt gec cat gcg gaa gcg 672 
Leu Ala val Ala Ala Leu Lys His Arg Phe Phe Ala His Ala Glu Ala 
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ctg ctg cac gqc gat ate cac age ggg teg ate ttc gtt gee gaa gqt 
Leu Leu His Gly Asp lie His Ser Gly Ser lie Phe Val Ala Glu Gly 
225 230 235 240 

age ctg aag gee ate gac gee gag ttc gqc tac ttc gqc ccc ate gqc 
Ser Leu Lys Ala lie Asp Ala Glu Phe Gly Tyr Phe Gly Pro lie Gly 
245 250 255 

ttc gat ate gqc ace gee ate gqc aac ctg ctg ctg aac tac tgc gqc 
Phe Asp lie Gly Thr Ala lie Gly Asn Leu Leu Leu Asn Tyr Cys Gly 
260 265 270 



720 



768 



816 



ctg ccg gqc cag etc gqc att cgc gat gec gec gec gcg cgc gag cag 864 

Leu Pro Gly Gin Leu Gly lie Arg Asp Ala Ala Ala Ala Arg Glu Gin 

275 280 285 

egg ctg aac gac ate cac cag ctg tgg acc acc ttc gec gag cgc ttc 912 

Arg Leu Asn Asp lie His Gin Leu Trp Thr Thr Phe Ala Glu Arg Phe 
290 295 300 

cag gcg ctg gcg gcg gag aaa acc cgc gac gcg gcg ctg get tac ccc 960 
Gin Ala Leu Ala Ala Glu Lys Thr Arg Asp Ala Ala Leu Ala Tyr Pro 
305 310 315 320 

gqc tac gec tec gee ttt ctg aag aaa gtc tgg gcg gac gcg gtc gqc 1008 
Gly Tyr Ala Ser Ala Phe Leu Lys Lys Val Trp Ala Asp Ala Val Gly 
325 330 335 

ttc tgc gqc age gaa ctg ate cgc cgc age gtc gqa ctg teg cac gtc 1056 
Phe Cys Gly Ser Glu Leu lie Arg Arg ser Val Gly Leu Ser His Val 
340 345 350 

gcg gat ate gac act ate cag gac gac gec atg cgt cat gag tgc ctg 1104 
Ala Asp lie Asp Thr lie Gin Asp Asp Ala Met Arg His Glu Cys Leu 
355 360 365 

cgc cac gec att acc ctg gqc aga gcg ctg ate gtq ctg gec gag cgt 1152 
Arg His Ala lie Thr Leu Gly Arg Ala Leu lie val Leu Ala Glu Arg 
370 375 ~ 380 

ate gac age gtc gac gag ctg ctg gcg egg gta cgc cag tac age tga 1200 
lie Asp Ser Val Asp Glu Leu Leu Ala Arg Val Arg Gin Tyr Ser 
385 390 395 

<210> 40 
<211> 399 
<212> PRT 

<213> Klebsiella pneumoniae 
<400> 40 

Met Ser Gin Tyr His Thr Phe Thr Ala His Asp Ala Val Ala Tyr Ala 
1 5 10 15 

Gin Gin Phe Ala Gly lie Asp Asn Pro ser Glu Leu val Ser Ala Gin 
20 25 30 

Glu Val Gly Asp Gly Asn Leu Asn Leu Val Phe Lys val Phe Asp Arg 
35 40 45 

Gin Gly Val Ser Arg Ala lie val Lys Gin Ala Leu Pro Tyr val Arg 
50 55 60 
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Cvs Val Gly Glu Ser Trp Pro Leu Thr Leu Asp Arg Ala Arg Leu Glu 
65 70 75 80 

Ala Gin Thr Leu val Ala His Tyr Gin His Ser Pro Gin His Thr val 
85 90 95 

Lys lie His His Phe Asp Pro Glu Leu Ala val Met val Met Glu Asp 
100 105 110 

Leu Ser Asp His Arg lie Trp Arg Gly Glu Leu lie Ala Asn val Tyr 
115 120 125 

Tyr Pro Gin Ala Ala Arg Gin Leu Gly Asp Tyr Leu Ala Gin val Leu 
130 135 140 

Phe His Thr Ser Asp Phe Tyr Leu His Pro His Glu Lys Lys Ala Gin 
145 150 155 160 

val Ala Gin Phe lie Asn Pro Ala Met Cys Glu lie Thr Glu Asp Leu 
165 170 175 

Phe Phe Asn Asp Pro Tyr Gin lie His Glu Arg Asn Asn Tyr Pro Ala 
180 185 190 

Glu Leu Glu Ala Asp val Ala Ala Leu Arg Asp Asp Ala Gin Leu Lys 
195 200 205 

Leu Ala val Ala Ala Leu Lys His Arg Phe Phe Ala His Ala Glu Ala 
210 215 220 

Leu Leu His Gly Asp lie His Ser Gly Ser lie Phe val Ala Glu Gly 
225 230 235 240 

Ser Leu Lys Ala lie Asp Ala Glu Phe Gly Tyr Phe Gly Pro lie Gly 
245 250 255 

Phe Asp lie Gly Thr Ala lie Gly Asn Leu Leu Leu Asn Tyr Cys Gly 
260 265 270 

Leu Pro Gly Gin Leu Gly lie Arg Asp Ala Ala Ala Ala Arg Glu Gin 
275 280 285 

Arg Leu Asn Asp lie His Gin Leu Trp Thr Thr Phe Ala Glu Arg Phe 
290 295 300 

Gin Ala Leu Ala Ala Glu Lys Thr Arg Asp Ala Ala Leu Ala Tyr Pro 
305 310 315 320 

Gly Tyr Ala Ser Ala Phe Leu Lys Lys val Trp Ala Asp Ala val Gly 
325 330 335 

Phe Cys Gly Ser Glu Leu lie Arg Arg Ser val Gly Leu Ser His Val 
340 345 350 

Ala Asp lie Asp Thr lie Gin Asp Asp Ala Met Arg His Glu cys Leu 
355 360 365 

Arg His Ala lie Thr Leu Gly Arg Ala Leu lie val Leu Ala Glu Arg 
370 375 380 

lie Asp Ser val Asp Glu Leu Leu Ala Arg val Arg Gin Tyr ser 
385 390 395 
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85 ' 90 95 

gga gaa tgt gga gat tgt cgt cat tgc cag teg gag gaa tea aac atg 
Glv Glu cys Gly Asp Cys Arg His Cys Gin Ser Glu Glu Ser Asn Met 
100 " 105 110 

tgt gat ctt etc agg ate aac aca gag cga gga ggt atg att cac gat 
Cys Asp Leu Leu Arg lie Asn Thr Glu Arg Gly Gly Met lie His Asp 
115 120 125 

ggt gaa tct aga ttc tec att aat gqc aaa cca ate tac cat ttc ctt 
Gly Glu Ser Arg Phe Ser He Asn Gly Lys Pro lie Tyr His Phe Leu 
130 135 140 

ggg acg tec acg ttc agt gag tac act gtq gtt cac tct gqt cag gtc 
Gly Thr Ser Thr Phe ser Glu Tyr Thr val val His Ser Gly Gin val 
145 150 155 160 

get aag ate aat ccg gat get cct ctt gac aag gtc tgt att gtc agt 
Ala Lys He Asn Pro Asp Ala Pro Leu Asp Lys val Cys lie val Ser 
165 170 175 

tgt ggt ttg tct act ggg tta gga gca act ttg aat gtq get aaa ccc 
Cys Gly Leu Ser Thr Gly Leu Gly Ala Thr Leu Asn val Ala Lys Pro 
180 185 190 

aag aaa ggt caa agt gtt gee att ttt gqt ctt ggt get gtt got tta 

Lys Lys Gly Gin Ser Val Ala He Phe Gly Leu Gly Ala val Gly Leu 

195 200 205 
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48 



sequence listing 

<210> 41 
<211> 1140 
<212> DNA 

<213> Arabidopsis thai i ana 
<220> 
<221> CDS 
<222> (1) . . (1137) 

<223> coding for alcohol dehydrogenase 
<400> 41 

atg tct acc acc gga cag att att cga tgc aaa get get gtq gca tgg 
Met Ser Thr Thr Gly Gin He lie Arg Cys Lys Ala Ala val Ala Trp 
15 10 15 

gaa gee gga aag cca ctg gtq ate gag gaa gtq gag gtt get cca ccg 96 
Glu Ala Gly Lys Pro Leu Val lie Glu Glu Val Glu val Ala Pro Pro 
20 25 30 

cag aaa cac gaa gtt cgt ate aag att etc ttc act tct etc tgt cac 
Gin Lys His Glu val Arg He Lys He Leu Phe Thr Ser Leu Cys His 
35 " 40 45 

acc gat gtt tac ttc tgg gaa get aag gga caa aca ccg ttg ttt cca 
Thr Asp val Tyr Phe Trp Glu Ala Lys Gly Gin Thr Pro Leu Phe Pro 
50 55 60 

cgt ate ttc ggc cat gaa get gga ggg att gtt gag agt gtt gga gaa 
Arq lie Phe Gly His Glu Ala Gly Gly lie Val Glu Ser Val Gly Glu 
65 70 75 80 

gga gtg act gat ctt cag cca gga gat cat gtq ttg ccg ate ttt acc 288 
Gly val Thr Asp Leu Gin Pro Gly Asp His val Leu Pro He Phe Thr 



144 



192 



240 



336 



384 



432 



480 



528 



576 



624 



sequence listing 

ggc get gca gaa ggt get aga ate get ggt get tct agg ate ate ggt 
Gly Ala Ala Glu Gly Ala Arg lie Ala Gly Ala Ser Arg He He Gly 
210 215 220 

gtt gat ttt aac tct aaa aga ttc gac caa get aag gaa ttc ggt gtg 
val Asp Phe Asn ser Lys Arg Phe Asp Gin Ala Lys Glu Phe Gly val 
225 230 235 240 

ace gag tgt gtq aac ccg aaa gac cat gac aag cca att caa cag gtq 
Thr Glu Cys Val Asn Pro Lys Asp His Asp Lys Pro lie Gin Gin val 
245 250 255 

ate get gag atg acg gat got ggg gtq gac agg agt gtq gaa tgc acc 
lie Ala Glu Met Thr Asp Gly Gly val Asp Arg Ser val Glu cys Thr 
260 265 270 

gga age gtt cag gee atg att caa gca ttt gaa tgt gtc cac gat ggc 
Gly Ser val Gin Ala Met He Gin Ala Phe Glu cys val His Asp Gly 
275 280 285 

tgg ggt gtt gca gtg ctg gtq ggt gtq cca age aaa gac gat gee ttc 
Trp Gly val Ala val Leu val Gly val Pro Ser Lys Asp Asp Ala Phe 
290 295 300 

aag act cat ccg atg aat ttc ttg aat gag agg act ctt aag ggt act 
Lys Thr His Pro Met Asn Phe Leu Asn Glu Arg Thr Leu Lys Gly Thr 
305 310 315 320 

ttc ttc ggg aac tac aaa ccc aaa act gac att ccc ggg gtt gtg gaa 
Phe Phe Gly Asn Tyr Lys Pro Lys Thr Asp He Pro Gly val val Glu 
325 330 335 

aag tac atg aac aag gag ctg gag ctt gag aaa ttc ate act cac aca 
Lys Tyr Met Asn Lys Glu Leu Glu Leu Glu Lys Phe lie Thr His Thr 
340 345 350 

gtg cca ttc teg gaa ate aac aag gee ttt gat tac atg ctg aag gga 
val Pro Phe Ser Glu lie Asn Lys Ala Phe Asp Tyr Met Leu Lys Gly 
355 360 365 

gag agt att cgt tgc ate ate acc atg ggt get tga 
Glu Ser lie Arg Cys lie lie Thr Met Gly Ala 
370 375 

<210> 42 
<211> 379 
<212> PRT 

<213> Arabidopsis thai i ana 

<400> 42 , , 

Met Ser Thr Thr Gly Gin lie lie Arg Cys Lys Ala Ala val Ala Trp 
15 10 15 

Glu Ala Gly Lys Pro Leu val lie Glu Glu val Glu val Ala Pro Pro 
20 25 30 

Gin Lys His Glu val Arg He Lys He Leu Phe Thr ser Leu Cys His 
35 40 45 

Thr Asp val Tyr Phe Trp Glu Ala Lys Gly Gin Thr Pro Leu Phe Pro 
50 55 60 
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672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1140 



sequence listing 
Arg lie Phe Gly His Glu Ala Gly Gly He val Glu Ser val Gly Glu 



Glv val Thr Asp Leu Gin Pro Gly Asp His val Leu Pro lie Phe Thr 
85 90 95 

Gly Glu Cys Gly Asp Cys Arg His Cys Gin Ser Glu Glu Ser Asn Met 
100 "105 110 

Cvs Asp Leu Leu Arg lie Asn Thr Glu Arg Gly Gly Met lie His Asp 
7 115 120 125 

Gly Glu Ser Arg Phe ser He Asn Gly Lys Pro lie Tyr His Phe Leu 
130 135 140 

Glv Thr Ser Thr Phe Ser Glu Tyr Thr val val His Ser Gly Gin val 
145 150 155 160 

Ala Lys He Asn Pro Asp Ala Pro Leu Asp Lys val cys He val Ser 
165 170 175 

cvs Glv Leu ser Thr Gly Leu Gly Ala Thr Leu Asn val Ala Lys Pro 
7 7 180 185 190 

Lvs Lys Gly Gin Ser val Ala He Phe Gly Leu Gly Ala val Gly Leu 
7 195 200 205 

Gly Ala Ala Glu Gly Ala Arg lie Ala Gly Ala Ser Arg He He Gly- 
210 215 220 

val Asp Phe Asn Ser Lys Arg Phe Asp Gin Ala Lys Glu Phe Gly val 
225 230 235 240 

Thr Glu Cys Val Asn Pro Lys Asp His Asp Lys Pro He Gin Gin val 
245 250 255 

lie Ala Glu Met Thr Asp Gly Gly val Asp Arg Ser val Glu Cys Thr 
260 265 270 

Gly ser val Gin Ala Met He Gin Ala Phe Glu cys val His Asp Gly 
275 280 285 

Trp Gly val Ala val Leu val Gly val Pro Ser Lys Asp Asp Ala Phe 
290 295 300 

Lys Thr His Pro Met Asn Phe Leu Asn Glu Arg Thr Leu Lys Gly Thr 
305 310 315 320 

Phe Phe Gly Asn Tyr Lys Pro Lys Thr Asp He Pro Gly val val Glu 
325 330 335 

Lvs Tvr Met Asn Lys Glu Leu Glu Leu Glu Lys Phe lie Thr His Thr 
7 340 345 350 

val Pro Phe ser Glu He Asn Lys Ala Phe Asp Tyr Met Leu Lys Gly 
355 360 365 

Glu Ser He Arg Cys He He Thr Met Gly Ala 
370 375 



<210> 43 

Page 67 



sequence listing 

<211> 1140 
<212> DNA 

<213> Hordeum vulgare 

<220> 

<221> CDS 

<222> (1)..(1137) 

<223> coding for alcohol dehydrogenase 
<400> 43 

atg gcg acg gcc ggc aag gtg ate aag tgc aaa gcc gcg gtg gcg tgg 
Met Ala Thr Ala Gly Lys val He Lys Cys Lys Ala Ala val Ala Trp 
15 10 15 

gag gcc ggg aag ccg ctg acc atg gag gag gtg gag gtg gcg ccg ccg 
Glu Ala Gly Lys Pro Leu Thr Met Glu Glu val Glu val Ala Pro Pro 
20 25 30 

cag gcc atg gag gtg cgc gtc aag ate etc ttc acc tec etc tgc cac 
Gin Ala Met Glu val Arg val Lys He Leu Phe Thr Ser Leu Cys His 
35 40 45 

acc gac gtc tac ttc tgg gag gcc aag ggg cag acc ccc atg ttc cct 
Thr Asp val Tyr Phe Trp Glu Ala Lys Gly Gin Thr Pro Met Phe Pro 
50 55 60 

egg ate ttc ggc cat gaa get gga ggc ata gtg gag agt gtt gga gag 
Arg lie Phe Gly His Glu Ala Gly Gly lie val Glu Ser val Gly Glu 
65 70 75 80 

ggc gtg act gat gtt gcc cct got gac cac gtc etc cct gtg ttc act 
Gly val Thr Asp val Ala Pro Gly Asp His val Leu Pro val Phe Thr 
85 90 95 

ggg gag tgt aag gaa tgc cca cat tgc aag tct gcg gag age aac atg 
Gly Glu cys Lys Glu Cys Pro His Cys Lys Ser Ala Glu Ser Asn Met 
100 ' 105 110 

tgt gat ctg etc agg ate aac acc gac aga ggt gtg atg ate ggg gat 
cys Asp Leu Leu Arg lie Asn Thr Asp Arg Gly val Met lie Gly Asp 
115 120 125 

ggc aag teg cgc ttc tct att ggc ggc aag ccg att tac cat ttc gta 
Gly Lys Ser Arg Phe Ser He Gly Gly Lys Pro lie Tyr His Phe val 
130 135 140 

ggg act tec acc ttc agt gag tac act gtc atg cat gtc ggt tgt gtt 
Gly Thr Ser Thr Phe Ser Glu Tyr Thr val Met His Val Gly Cys val 
145 150 155 160 

gcc aag ate aac cct gag get ccc ctt gat aaa gtc tgt gtt ctt age 
Ala Lys lie Asn Pro Glu Ala Pro Leu Asp Lys val Cys val Leu Ser 
165 170 175 

tgt ggt att tgc act ggt ctt ggc gcg tea att aat gtt gca aaa cca 
Cys Gly He Cys Thr Gly Leu Gly Ala Ser He Asn val Ala Lys Pro 
180 185 190 

cca aag ggt tec aca gtg gcg ata ttt ggg eta gga get gtt ggc ctt 
pro Lys Gly Ser Thr val Ala He Phe Gly Leu Gly Ala val Gly Leu 
195 200 205 

get get gca gaa ggt gca agg att gca ggt gca tea agg ate att ggt 
Ala Ala Ala Glu Gly Ala Arg He Ala Gly Ala Ser Arg He He Gly 
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sequence listing 

210 215 220 

gtt gac ctg aac gcc age aga ttt gaa gag get agg aag ttt ggc tgc 720 

val Asp Leu Asn Ala Ser Arg Phe Glu Glu Ala Arg Lys Phe Gly Cys 

225 230 "* 235 240 

acg gaa ttt gtg aac ccg aaa gat cac acc aag cca gtt cag cag gtg 768 

Thr Glu Phe val Asn Pro Lys Asp His Thr Lys Pro val Gin Gin val 

245 250 255 



etc get gac atg aca aat ggc gqa gtt gac cgc agt gtt gag tgc act 
Leu Ala Asp Met Thr Asn Gly Gly val Asp Arg Ser Val Glu Cys Thr 
260 265 270 



tgg ggt gta get gtg ctg gtg ggt gtq cca cac aag gac get gaa ttc 
Trp Gly val Ala val Leu val Gly Val Pro His Lys Asp Ala Glu Phe 
290 295 300 



gtq ccg ttc teg gag ata aac aag gcc ttc gac ctt atg gcg aag gqg 
Val Pro Phe Ser Glu lie Asn Lys Ala Phe Asp Leu Met Ala Lys Gly 
355 360 365 



816 



ggc aac gtc aat get atg ata caa gca ttt gaa tgt gtt cat gat ggc 864 
Gly Asn val Asn Ala Met lie Gin Ala Phe Glu Cys Val His Asp Gly 
275 280 285 



912 



aag acc cac ccg atg aac ttc ctg aat gag agg acc ctg aag ggc acc 960 

Lys Thr His Pro Met Asn Phe Leu Asn Glu Arg Thr Leu Lys Gly Thr 

305 310 315 320 

ttc ttc ggt aac ttc aag ccg cgc act gac ctg ccc aat gtc gtg gag 1008 

Phe Phe Gly Asn Phe Lys Pro Arg Thr Asp Leu Pro Asn val val Glu 

325 330 335 

atg tac atg aag aag gag ctg gag gtg gag aag ttc ate aca cac age 1056 

Met Tyr Met Lys Lys Glu Leu Glu Val Glu Lys Phe lie Thr His Ser 

340 345 350 



1104 



gag ggc ate cgt tgc ate ate cgc atg gac aac tag 1140 
Glu Gly lie Arg cys lie lie Arg Met Asp Asn 
370 375 

<210> 44 
<211> 379 
<212> PRT 

<213> Hordeum vulgare 
<400> 44 

Met Ala Thr Ala Gly Lys val lie Lys Cys Lys Ala Ala val Ala Trp 
15 10 15 

Glu Ala Gly Lys Pro Leu Thr Met Glu Glu val Glu Val Ala Pro Pro 
20 25 30 

Gin Ala Met Glu val Arg val Lys lie Leu Phe Thr Ser Leu Cys His 
35 40 45 

Thr Asp val Tyr Phe Trp Glu Ala Lys Gly Gin Thr Pro Met Phe Pro 
50 55 60 

Arg lie Phe Gly His Glu Ala Gly Gly lie Val Glu Ser Val Gly Glu 
65 70 75 80 
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sequence listing 

Gly val Thr Asp val Ala Pro Gly Asp His val Leu Pro val Phe Thr 
85 90 95 

Glv Glu Cvs Lys Glu Cys Pro His Cys Lys Ser Ala Glu Ser Asn Met 
100 105 110 

Cvs Asp Leu Leu Arg lie Asn Thr Asp Arg Gly val Met lie Gly Asp 
115 120 125 

Gly Lys ser Arg Phe Ser lie Gly Gly Lys Pro He Tyr His Phe val 
130 135 140 

Gly Thr Ser Thr Phe Ser Glu Tyr Thr val Met His val Gly Cys val 
145 150 155 160 

Ala Lys lie Asn Pro Glu Ala Pro Leu Asp Lys Val Cys Val Leu Ser 
165 170 175 

Cvs Gly lie Cys Thr Gly Leu Gly Ala Ser lie Asn Val Ala Lys Pro 
y 180 185 190 

pro Lys Gly Ser Thr val Ala lie Phe Gly Leu Gly Ala val Gly Leu 
195 200 205 

Ala Ala Ala Glu Gly Ala Arg He Ala Gly Ala ser Arg He lie Gly 
210 215 220 

val Asp Leu Asn Ala Ser Arg Phe Glu Glu Ala Arg Lys Phe Gly Cys 
225 230 235 240 

Thr Glu Phe val Asn Pro Lys Asp His Thr Lys Pro val Gin Gin val 
245 250 255 

Leu Ala Asp Met Thr Asn Gly Gly Val Asp Arg Ser val Glu Cys Thr 
260 265 270 

Gly Asn val Asn Ala Met He Gin Ala Phe Glu cys val His Asp Gly 
275 280 285 

Trp Gly Val Ala Val Leu Val Gly Val Pro His Lys Asp Ala Glu Phe 
290 295 300 

Lys Thr His Pro Met Asn Phe Leu Asn Glu Arg Thr Leu Lys Gly Thr 
305 310 315 320 

Phe Phe Gly Asn Phe Lys Pro Arg Thr Asp Leu Pro Asn val val Glu 
325 330 335 

Met Tyr Met Lys Lys Glu Leu Glu val Glu Lys Phe lie Thr His Ser 
340 345 350 

val Pro Phe ser Glu lie Asn Lys Ala Phe Asp Leu Met Ala Lys Gly 
355 360 365 

Glu Gly He Arg Cys He He Arg Met Asp Asn 
370 375 

<210> 45 
<211> 1140 
<212> DNA 

<213> Oryza sativa 
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sequence listing 

<220> 

<221> CDS 

<222> (1)..(1137) 

<223> coding for alcohol dehydrogenase 
<400> 45 

atg gcg acc gca ggg aag gtq ate aag tgc aaa gcg gcg gtq gca tgg 
Met Ala Thr Ala Gly Lys val He Lys Cys Lys Ala Ala val Ala Trp 
15 10 15 

gag gcc gcg aag ccg ctg gtg ate gag gag gtg gag gtg gcg ccg ccg 
Glu Ala Ala Lys Pro Leu val lie Glu Glu val Glu val Ala Pro Pro 
20 25 30 

cag gcc atg gag gtg cgc gtc aag ate etc ttc acc teg etc tgc cac 
Gin Ala Met Glu val Arg Val Lys lie Leu Phe Thr Ser Leu Cys His 
35 " 40 45 

acc gac gtc tac ttc tgg gag gcc aag gga cag act ccc gtg ttc cct 
Thr Asp Val Tyr Phe Trp Glu Ala Lys Gly Gin Thr Pro val Phe Pro 
50 55 60 



ggt gtg act gat ctt gcc cct gat gac cat gtt etc cct gtg ttc act 
Gly val Thr Asp Leu Ala Pro Gly Asp His val Leu Pro val Phe Thr 
85 90 95 

ggg gag tgc aag gag tgt gcc cac tgc aag tea gca gag age aac atg 
Gly Glu Cys Lys Glu Cys Ala His cys Lys Ser Ala Glu Ser Asn Met 
100 105 110 

tgt gat ctg etc agg ate aac act gac agg ggt gtg atg att ggt gat 
cys Asp Leu Leu Arg lie Asn Thr Asp Arg Gly val Met lie Gly Asp 
115 120 125 

ggc aaa tea cgc ttt tec ate aac ggg aag ccc att tac cat ttc gtc 
Gly Lys Ser Arg Phe Ser lie Asn Gly Lys Pro lie Tyr His Phe val 
130 135 140 

ggg act teg acc ttc age gag tac act gtc atg cat gtt ggt tgc gtt 
Gly Thr Ser Thr Phe Ser Glu Tyr Thr val Met His val Gly Cys val 
145 150 155 160 

gcg aag ate aac ccg gca get cca ctt gat aaa gtt tgc gtt ctt age 
Ala Lys lie Asn Pro Ala Ala Pro Leu Asp Lys val Cys val Leu Ser 
165 170 175 

tgt ggt att tct act ggt ctt ggt get aca ate aat gtg gca aag cca 
cys Gly lie ser Thr Gly Leu Gly Ala Thr lie Asn val Ala Lys Pro 
180 185 190 

cca aag ggt teg acg gtg gcg ata ttt ggt eta gga get gta ggc ctt 
pro Lys Gly Ser Thr val Ala lie Phe Gly Leu Gly Ala val Gly Leu 
195 200 205 

get gcc gca gaa ggt gca agg att gca gga gcg tea agg ate att ggc 
Ala Ala Ala Glu Gly Ala Arg lie Ala Gly Ala Ser Arg He He Gly 
210 215 220 



48 



96 



144 



192 



egg ate ttc ggc cat gaa get gga ggt att gtg gag agt gtt gga gag 240 
Arq lie Phe Gly His Glu Ala Gly Gly lie val Glu Ser val Gly Glu 
65 70 75 80 



288 



336 



384 



432 



480 



528 



576 



624 



672 



att gac ctg aac gcc aac aga ttt gaa gaa get agg aaa ttt ggt tgc 720 
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sequence listing 

lie Asp Leu Asn Ala Asn Arg Phe Glu Glu Ala Arg Lys Phe Gly Cys 

225 230 235 240 

act gaa ttt gtq aac cca aag gac cat gac aag cca gtt cag cag gta 

Thr Glu Phe val Asn Pro Lys Asp His Asp Lys Pro val Gin Gin val 

245 250 255 

ctt get gag atg acc aat gqc gga gtt gac cgc age gtt gaa tgc act 

Leu Ala Glu Met Thr Asn Gly Gly Val Asp Arg Ser val Glu Cys Thr 

260 265 270 

gqc aac ate aac gec atg ate caa gca ttt gaa tgt gtt cat gat gqc 

Gly Asn lie Asn Ala Met lie Gin Ala Phe Glu Cys val His Asp Gly 

275 280 285 



aag acc cac ccg atg aac ttc ctg aac gag agg act etc aag gga acc 
Lys Thr His Pro Met Asn Phe Leu Asn Glu Arg Thr Leu Lys Gly Thr 
305 310 315 320 

ttc ttc gqc aac tac aag cca cgc acc gat ctg ccc aac gtc gtc gag 
Phe Phe Gly Asn Tyr Lys Pro Arg Thr Asp Leu Pro Asn Val val Glu 
325 330 335 

etc tac atg aag aag gag ctg gag gtq gag aag ttc ate aca cac age 
Leu Tyr Met Lys Lys Glu Leu Glu val Glu Lys Phe lie Thr His Ser 
340 345 350 

gtq ccg ttc teg gag ate aac acg gcg ttc gac ctg atg cac aag gqc 
val Pro Phe Ser Glu lie Asn Thr Ala Phe Asp Leu Met His Lys Gly 
355 360 365 

gag gqc ate cgc tgc ate ate cgc atg gag aac tga 
Glu Gly lie Arg Cys lie lie Arg Met Glu Asn 
370 375 

<210> 46 
<211> 379 
<212> PRT 

<213> oryza sativa 
<400> 46 

Met Ala Thr Ala Gly Lys val lie Lys Cys Lys Ala Ala val Ala Trp 
1 5 10 15 

Glu Ala Ala Lys Pro Leu val lie Glu Glu Val Glu val Ala Pro Pro 
20 25 30 

Gin Ala Met Glu val Arg Val Lys lie Leu Phe Thr Ser Leu Cys His 
35 "* 40 45 

Thr Asp val Tyr Phe Trp Glu Ala Lys Gly Gin Thr Pro val Phe Pro 
50 55 60 

Arg lie Phe Gly His Glu Ala Gly Gly lie val Glu Ser val Gly Glu 
65 70 75 80 

Gly val Thr Asp Leu Ala Pro Gly Asp His val Leu Pro val Phe Thr 
85 90 95 
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768 



816 



864 



tgg ggt gtt get gtt ttg gtc gqc gtq cca cac aag gac gec gag ttc 912 
Trp Gly val Ala val Leu Val Gly Val Pro His Lys Asp Ala Glu Phe 
290 295 300 



960 



1008 



1056 



1104 



1140 



sequence listing 

Glv Glu Cys Lys Glu Cys Ala His Cys Lys Ser Ala Glu Ser Asn Met 
100 105 110 

Cys Asp Leu Leu Arg lie Asn Thr Asp Arg Gly Val Met lie Gly Asp 
115 " 120 125 

Gly Lys Ser Arg Phe Ser lie Asn Gly Lys Pro lie Tyr His Phe val 
130 135 140 

Gly Thr ser Thr Phe Ser Glu Tyr Thr val Met His val Gly Cys val 
145 150 155 160 

Ala Lys lie Asn Pro Ala Ala Pro Leu Asp Lys val Cys val Leu ser 
165 170 175 

cys Gly He Ser Thr Gly Leu Gly Ala Thr lie Asn val Ala Lys Pro 
180 185 190 

Pro Lys Gly Ser Thr val Ala lie Phe Gly Leu Gly Ala Val Gly Leu 
195 200 205 

Ala Ala Ala Glu Gly Ala Arg lie Ala Gly Ala ser Arg lie He Gly 
210 215 220 

lie Asp Leu Asn Ala Asn Arg Phe Glu Glu Ala Arg Lys Phe Gly Cys 
225 230 " 235 240 

Thr Glu Phe val Asn Pro Lys Asp His Asp Lys Pro val Gin Gin val 
245 250 255 

Leu Ala Glu Met Thr Asn Gly Gly val Asp Arg Ser val Glu Cys Thr 
260 265 270 

Gly Asn lie Asn Ala Met He Gin Ala Phe Glu Cys val His Asp Gly 
275 280 285 

Trp Gly val Ala val Leu val Gly val Pro His Lys Asp Ala Glu Phe 
290 295 300 

Lys Thr His Pro Met Asn Phe Leu Asn Glu Arg Thr Leu Lys Gly Thr 
305 310 315 320 

Phe Phe Gly Asn Tyr Lys Pro Arg Thr Asp Leu Pro Asn val val Glu 
325 330 335 

Leu Tyr Met Lys Lys Glu Leu Glu val Glu Lys Phe lie Thr His Ser 
340 345 350 

val Pro Phe Ser Glu lie Asn Thr Ala Phe Asp Leu Met His Lys Gly 
355 360 365 

Glu Gly He Arg Cys lie lie Arg Met Glu Asn 
370 375 



<210> 47 

<211> 1140 

<212> DNA 

<213> zea mays 

<220> 

<221> CDS 

<222> (1) . . (1137) 
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sequence listing 
<223> coding for alcohol dehydrogenase 

<400> 47 

atg gcg acc gcg ggg aag gtg ate aag tgc aaa get gcg gtg gca tgg 48 

Met Ala Thr Ala Gly Lys val lie Lys Cys Lys Ala Ala val Ala Trp 

15 10 15 

gag gec ggc aag cca ctg teg ate gag gag gtg gag gta gcg cct ccg 96 

Glu Ala Gly Lys Pro Leu Ser lie Glu Glu val Glu Val Ala Pro Pro 

20 25 30 

cag gee atg gag gtg cgc gtc aag ate etc ttc acc teg etc tgc cac 144 

Gin Ala Met Glu val Arg val Lys lie Leu Phe Thr Ser Leu Cys His 

35 40 45 

acc gac gtc tac ttc tgg gag gee aag ggg cag act ccc gtg ttc cct 192 

Thr Asp Val Tyr Phe Trp Glu Ala Lys Gly Gin Thr Pro val Phe Pro 
50 55 60 



egg ate ttt ggc cat gag get gga ggt ate ata gag agt gtt gga gag 

Arg lie Phe Gly His Glu Ala Gly Gly lie lie Glu Ser Val Gly Glu 

65 70 75 80 

ggt gtg act gac gta get ccg ggc gac cat gtc ctt cct gtg ttc act 

Gly val Thr Asp val Ala Pro Gly Asp His val Leu Pro val Phe Thr 

85 90 95 



ggg act tec acc ttc age gag tac acc gtc atg cat gtc ggt tgt gtt 
Gly Thr ser Thr Phe Ser. Glu Tyr Thr Val Met His Val Gly Cys Val 
145 150 155 160 



tgt ggt att tct act ggt ctt ggt gca tea att aat gtt gca aaa cct 

Cys Gly lie Ser Thr Gly Leu Gly Ala Ser lie Asn val Ala Lys Pro 
180 185 190 

ccg aag ggt teg aca gtg get gtt ttc ggt tta gga gee gtt ggt ctt 

Pro Lys Gly Ser Thr Val Ala Val Phe Gly Leu Gly Ala val Gly Leu 
195 200 205 



240 



288 



ggg gag tgc aag gag tgc gee cac tgc aag teg gca gag age aac atg 336 

Gly Glu Cys Lys Glu Cys Ala His Cys Lys Ser Ala Glu Ser Asn Met 

100 105 110 

tgt gat ttg etc agg ate aac act gac cgc ggt gtg atg att ggc gat 384 

Cys Asp Leu Leu Arg lie Asn Thr Asp Arg Gly Val Met lie Gly Asp 

115 ' 120 " 125 

ggc aag teg egg ttt tea ate aat ggg aag cct ate tac cac ttt gtt 432 

Gly Lys Ser Arg Phe Ser lie Asn Gly Lys Pro lie Tyr His Phe val 

130 ~ 135 140 



480 



gca aag ate aac cct cag get ccc ctt gat aaa gtt tgc gtc ctt age 528 
Ala Lys lie Asn Pro Gin Ala Pro Leu Asp Lys Val Cys Val Leu Ser 
165 170 175 



576 



624 



gee get gca gaa ggt gca agg att get gga gcg tea agg ate att ggt 672 

Ala Ala Ala Glu Gly Ala Arg lie Ala Gly Ala Ser Arg lie lie Gly 

210 215 220 

gtc gac ctg aac ccc age aga ttc gaa gaa get agg aag ttc ggt tgc 720 

val Asp Leu Asn Pro Ser Arg Phe Glu Glu Ala Arg Lys Phe Gly Cys 

225 230 235 240 
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sequence listing 
act gaa ttt gtg aac cca aaa gac cac aac aag ccg gtg cag gag gta 
Thr Glu Phe val Asn Pro Lys Asp His Asn Lys Pro val Gin Glu val 
245 250 255 

ctt get gag atg acc aac gga ggg gtc gac cgc age gtg gaa tgc act 
Leu Ala Glu Met Thr Asn Gly Gly val Asp Arg Ser val Glu Cys Thr 
260 265 270 

ggc aac ate aat get atg ate caa get ttc gaa tgt gtt cat gat ggc 
Gly Asn He Asn Ala Met lie Gin Ala Phe Glu Cys val His Asp Gly 
275 280 285 

tgg ggt gtt gec gtg ctg gtg ggt gtg ccg cat aag gac get gag ttc 
Trp Gly val Ala val Leu val Gly val Pro His Lys Asp Ala Glu Phe 
290 295 300 

aag acc cac ccg atg aac ttc ctg aac gaa agg acc ctg aag ggg acc 
Lys Thr His Pro Met Asn Phe Leu Asn Glu Arg Thr Leu Lys Gly Thr 
305 310 315 320 

ttc ttt ggc aac tat aag cca cgc act gat ctg cca aat gtg gtg gag 
Phe Phe Gly Asn Tyr Lys Pro Arg Thr Asp Leu Pro Asn Val val Glu 
325 330 335 

ctg tac atg aaa aag gag ctg gag gtg gag aag ttc ate acg cac age 
Leu Tyr Met Lys Lys Glu Leu Glu val Glu Lys Phe lie Thr His Ser 
340 345 350 

gtc ccg ttc gcg gag ate aac aag gcg ttc aac ctg atg gec aag ggg 
val Pro Phe Ala Glu lie Asn Lys Ala Phe Asn Leu Met Ala Lys Gly 
355 360 365 

gag ggc ate cgc tgc ate ate cgc atg gag aac tag 
Glu Gly lie Arg cys lie lie Arg Met Glu Asn 
370 375 

<210> 48 
<211> 379 
<212> PRT 
<213> Zea mays 

<400> 48 , , 

Met Ala Thr Ala Gly Lys val He Lys Cys Lys Ala Ala val Ala Trp 
15 10 15 

Glu Ala Gly Lys Pro Leu Ser lie Glu Glu val Glu val Ala Pro Pro 
20 25 30 

Gin Ala Met Glu val Arg val Lys lie Leu Phe Thr ser Leu Cys His 
35 " 40 45 

Thr Asp val Tyr Phe Trp Glu Ala Lys Gly Gin Thr Pro val Phe Pro 
50 55 60 

Arg He Phe Gly His Glu Ala Gly Gly He lie Glu Ser val Gly Glu 
65 70 75 80 

Gly val Thr Asp Val Ala Pro Gly Asp His val Leu Pro val Phe Thr 
85 90 95 

Gly Glu Cys Lys Glu Cys Ala His Cys Lys Ser Ala Glu Ser Asn Met 
100 105 110 
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768 



816 



864 



912 



960 



1008 



1056 



1104 



1140 



sequence listing 

Cys Asp Leu Leu Arg lie Asn Thr Asp Arg Gly Val Met lie Gly Asp 
115 120 125 

Gly Lys Ser Arg Phe ser lie Asn Gly Lys Pro lie Tyr His Phe val 
130 135 140 

Gly Thr ser Thr Phe Ser Glu Tyr Thr val Met His val Gly Cys val 
145 150 155 160 

Ala Lys lie Asn Pro Gin Ala Pro Leu Asp Lys Val Cys Val Leu Ser 
165 170 175 

Cys Gly lie Ser Thr Gly Leu Gly Ala Ser lie Asn val Ala Lys Pro 
180 185 190 

Pro Lys Gly Ser Thr Val Ala Val Phe Gly Leu Gly Ala val Gly Leu 
195 200 205 

Ala Ala Ala Glu Gly Ala Arg lie Ala Gly Ala Ser Arg lie lie Gly 
210 215 220 

val Asp Leu Asn Pro Ser Arg Phe Glu Glu Ala Arg Lys Phe Gly Cys 
225 230 235 240 

Thr Glu Phe Val Asn Pro Lys Asp His Asn Lys Pro val Gin Glu Val 
245 250 255 

Leu Ala Glu Met Thr Asn Gly Gly val Asp Arg Ser val Glu Cys Thr 
260 265 270 

Gly Asn lie Asn Ala Met lie Gin Ala Phe Glu Cys Val His Asp Gly 
275 280 285 

Trp Gly val Ala Val Leu val Gly Val Pro His Lys Asp Ala Glu Phe 
290 295 300 

Lys Thr His Pro Met Asn Phe Leu Asn Glu Arg Thr Leu Lys Gly Thr 
305 310 315 320 

Phe Phe Gly Asn Tyr Lys Pro Arg Thr Asp Leu Pro Asn Val val Glu 
325 330 335 

Leu Tyr Met Lys Lys Glu Leu Glu Val Glu Lys Phe lie Thr His Ser 
340 345 350 

val Pro Phe Ala Glu lie Asn Lys Ala Phe Asn Leu Met Ala Lys Gly 
355 360 365 

Glu Gly lie Arg Cys lie lie Arg Met Glu Asn 
370 375 

<210> 49 

<211> 505 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: coding for 
sense RNA-f ragment of E.coli codA gene 

<400> 49 

aagcttggct aacagtgtcg aataacgctt tacaaacaat tattaacgcc cggttaccag 60 
gcgaagaggg gctgtggcag attcatctgc aggacggaaa aatcagcgcc attgatgcgc 120 
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sequence listing 

aatccggcgt gatgcccata actgaaaaca gcctggatgc cgaacaaggt ttagttatac 180 

cgccgtttgt ggagccacat attcacctgg acaccacgca aaccgccgga caaccgaact 240 

ggaatcagtc cggcacgctg tttgaaggca ttgaacgctg ggccgagcgc aaagcgttat 300 

taacccatga cgatgtgaaa caacgcgcat ggcaaacgct gaaatggcag attgccaacg 360 

gcattcagca tgtgcgtacc catgtcgatg tttcggatgc aacgctaact gcgctgaaag 420 

caatgctgga agtgaagcag gaagtcgcgc cgtggattga tctgcaaatc gtcgccttcc 480 

ctcaggaagg gattttgtcg tcgac 505 

<210> 50 
<211> 27 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 50 

cgtgaatacg gcgtggagtc g 21 

<210> 51 
<211> 26 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 51 

cggcaggata atcaggttgg 20 

<210> 52 
<2U> 505 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: coding for 
anti sense RNA-f ragment of E.coli codA gene 

<400> 52 

gaattcggct aacagtgtcg aataacgctt tacaaacaat tattaacgcc cggttaccag 60 

gcgaagaggg gctgtggcag attcatctgc aggacggaaa aatcagcgcc attgatgcgc 120 

aatccggcgt gatgcccata actgaaaaca gcctggatgc cgaacaaggt ttagttatac 180 

cgccgtttgt ggagccacat attcacctgg acaccacgca aaccgccgga caaccgaact 240 

ggaatcagtc cggcacgctg tttgaaggca ttgaacgctg ggccgagcgc aaagcgttat 300 

taacccatga cgatgtgaaa caacgcgcat ggcaaacgct gaaatggcag attgccaacg 360 

gcattcagca tgtgcgtacc catgtcgatg tttcggatgc aacgctaact gcgctgaaag 420 

caatgctgga agtgaagcag gaagtcgcgc cgtggattga tctgcaaatc gtcgccttcc 480 

ctcaggaagg gattttgtcg gatcc 505 

<210> 53 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 



<400> 53 

gtcaacgtaa ccaaccctgc 
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20 



sequence listing 



<210> 54 
<211> 26 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 54 

ggatccgaca aaatcccttc ctgagg 26 



<210> 55 
<211> 5674 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: vector 
construct pBluKS-ni tP-STLSl-35S-T 

<400> 55 

ccagcttttg ttccctttag tgagggttaa tttcgagctt ggcgtaatca tggtcatagc 60 

tgtttcctgt gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca 120 

taaagtgtaa agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct 180 

cactgcccgc tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac 240 

gcgcggggag aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc 300 

tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt 360 

tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg 420 

ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg 480 

agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat 540 

accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta 600 

ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct 660 

gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc 720 

ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa 780 

gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg 840 

taggcggtgc tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag 900 

tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt 960 

gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 1020 

cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 1080 

agtggaacga aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca 1140 

cctagatcct tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa 1200 

cttggtctga cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat 1260 

ttcgttcatc catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct 1320 

taccatctgg ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt 1380 

tatcagcaat aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat 1440 

ccgcctccat ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta 1500 

atagtttgcg caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg 1560 

gtatggcttc attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt 1620 

tgtgcaaaaa agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg 1680 

cagtgttatc actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg 1740 

taagatgctt ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc 1800 

ggcgaccgag ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa 1860 

ctttaaaagt gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac 1920 

cgctgttgag atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt 1980 

ttactttcac cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg 2040 

gaataagggc gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa 2100 

gcatttatca gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 2160 

aacaaatagg ggttccgcgc acatttcccc gaaaagtgcc acctgacgcg ccctgtagcg 2220 

gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg 2280 

ccctagcgcc cgctcctttc gctttcttcc cttcctttct cgccacgttc gccggctttc 2340 

cccgtcaagc tctaaatcgg gggctccctt tagggttccg atttagtgct ttacggcacc 2400 

tcgaccccaa aaaacttgat tagggtgatg gttcacgtag tgggccatcg ccctgataga 2460 
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cggtttttcg 
ctggaacaac 
tttcggccta 
aaatattaac 
atcggtgcgg 
attaagttgg 
attgtaatac 
tgaccgtaaa 
aagatgactc 
gtcaagttca 
ttcgttaata 
aatttgagtg 
tattatgaat 
catttagaat 
acatatatta 
tagaagctat 
tatatatatg 
cataaaaaga 
agtggaattt 
tttcttttat 
gggtttggtt 
attcaaaaat 
ccaataatag 
tttatttaat 
taagattaaa 
aaaatctata 
tacagctaag 
tcaatattat 
atatctataa 
gatttaagga 
aagtttttta 
attgtgtcca 
ttttatattc 
ctcaattatt 
agaaaagaaa 
cgttaccgtc 
tatattgctt 
ctgcaggcat 
tataataatt 
agaatgtagt 
aacttttcta 
tcggtacgct 
aataatgtgt 
tgtgttgagc 
taaaatttct 
tatagatctt 
ccgttcgaag 

gggttggtta 

aaggaggaag 
ttgatagagg 
ctataccccc 
aaattggtat 
caaaatacga 
ccctcacacc 



ccctttgacg 
actcaaccct 
ttggttaaaa 
gcttacaatt 
gcctcttcgc 
gtaacgccag 
gactcactat 
tgagcacccg 
aaacaagtaa 
aaatgttatg 
atggaagaga 
ttcaattctg 
ctggctttga 
gtttaataat 
atttattttt 
gctaaaccat 
gatggctgtc 
ctatttcgtt 
gagttcctgc 
ctgaaagctg 
ttcttgatgc 
tttgatttct 
taaatccaaa 
ccagtttctc 
taggagccaa 
cgtcaaattt 
acaaaccaaa 
taggtagaat 
catagtatat 
atccataaaa 
tttgtttttt 
aatactttgt 
agacataaag 
agtaggtttg 
ccacgtgggc 
tccaatagga 
ctctcaagtc 
gcaagcttat 
atcattaatt 
atatagcaat 
atatatgacc 
gaaatcacca 
gagtagtttc 
atataagaaa 
aattcctaaa 
tgtcgtgaat 
ctagaagtac 
cgttgactcc 
acaaggaagg 
tacgctacta 
taataacccc 
cagagccatg 
aagagttctt 
ggtgacgggg 



ttggagtcca 
atctcggtct 
aatgagctga 
tccattcgcc 
tattacgcca 
ggttttccca 
agggcgaatt 
aagaaaccgg 
tcggttgtga 
ttgagtttca 
accaattctc 
gatgttgtgt 
tttcagttgt 
taagttactg 
cttaaaaatg 
tacgtatatg 
acaatcagaa 
tcattgacaa 
aagtaagaaa 
caatgaaata 
gaactaattg 
cttgtaggaa 
gatatttgac 
tgagataatt 
actcacattt 
tatttatgac 
aaaaaaatac 
cgtacgacca 
attattacct 
gaataaattc 
atttggtagt 
aaaccaccgt 
catcggctgg 
atgtgaactt 
tagtcccacc 
aaataaacgt 
tctgagtgtc 
cgtcgactac 
agtagtaata 
tgcttttctg 
aaaatttgtt 
gtctctctct 
ccgataaggg 
cccttagtat 
accaaaatcc 
ataaaccaga 
cgcttaggca 
cccgtaggtt 
ataaggttgc 
tacttatact 
ttatcaattt 
aataggtcta 
aactctaaag 
atcgcgatgg 



sequence 1 
cgttctttaa 
attcttttga 
tttaacaaaa 
attcaggctg 
gctggcgaaa 
gtcacgacgt 
ggagctcgtc 
tcacattcat 
ttcgtcagtt 
aacttttatg 
ttgtatctaa 
taccctacat 
gttcttttct 
tatttccaca 
ttttggaatg 
tgacttcaca 
acgtgatcga 
tttgtgttta 
gatgaaataa 
ttcctaccaa 
gttatataag 
tatgatttac 
cggaccaaaa 
cattaaggaa 
caaatattaa 
atttcttatt 
tttctaagtg 
aaaaaaggta 
attatgagga 
tacgggaaaa 
tctccatttt 
tgtaattctt 
tttaatcaat 
tacaaaaaaa 
ttgtttcatt 
gaccacaaaa 
atgaaccaaa 
gtaagtttct 
taatatttca 
tagtttataa 
gatgtgcagg 
acaaatctat 
gaanttaggg 
gtatttgtat 
agtactaaaa 
cacgagacga 
ggaggccgtt 
tggtttaaat 
aggccctgtg 
atacgctaag 
aagaaataat 
tgaccaaaac 
ataaaagatc 
gtac 



i sting 
tagtggactc 
tttataaggg 
atttaacgcg 
cgcaactgtt 
gggggatgtg 
tgtaaaacga 
gagaccagat 
ttcgaaggtg 
catgtcactc 
ctaaactttt 
agattatcca 
tctacaacca 
tttttttctt 
tacattagtt 
actaatattg 
tgttgttgtt 
aaaaagacaa 
tttgtaaaga 
aagacttgag 
gcccgtttga 
aaactataca 
tatatgagac 
cacattgatc 
aacttagtat 
ataacataaa 
taaatttata 
gtccaaaaca 
ggttaatacg 
atcaaaatgc 
aaaatggaat 
gttttatttc 
aaacggggtt 
caatagattt 
acaaaaacaa 
tccaccacag 
aaaaaacaaa 
gtaaaaaaca 
gcttctacct 
aatatttttt 
gtgtgtatat 
tatcaccgga 
ctctctctat 
ttcttatagg 
ttgtaaaata 
tccagatctc 
ctaaacctgg 
agggaaaaga 
atgatgaagt 
caaggtaaga 
ggaatgcttg 
ccgcataagc 
tcaagaggat 
tttcaagatc 



ttgttccaaa 2520 
attttgccga 2580 
aattttaaca 2640 
gggaagggcg 2700 
ctgcaaggcg 2760 
cggccagtga 2820 
gttttacact 2880 
gagaaagcgg 2940 
ctatgaagga 3000 
tttctttatt 3060 
tctatcatcc 3120 
tgtagccaat 3180 
tgcatatttg 3240 
ccaagaatat 3300 
acaacgaaaa 3360 
ttacattccc 3420 
acagtgtttg 3480 
aaagtggcaa 3540 
tgtgtgtttt 3600 
ttattaattg 3660 
atccatgtta 3720 
tttcttttcg 3780 
tattttttag 3840 
taacccatcc 3900 
atggatttaa 3960 
tttaatgaaa 4020 
tcaattccgt 4080 
aattagaaac 4140 
atcaaatatg 4200 
aaattctttt 4260 
gtttggattt 4320 
ttcacttctt 4380 
tatttttctt 4440 
atcaatgcag 4500 
gttcgatctt 4560 
aaaaagtcta 4620 
aagactcgac 4680 
ttgatatata 4740 
tcaaaataaa 4800 
tttaatttat 4860 
tccatcgaat 4920 
tttctccata 4980 
gtttcgctca 5040 
cttctatcaa 5100 
ctaaagtccc 5160 
agcccagacg 5220 
tgctaaggca 5280 
ggacggaagg 5340 
agatggaaat 5400 
tatttatacc 5460 
ccccgcttaa 5520 
aaaacctcac 5580 
aaaactagtt 5640 
5674 



<210> 56 
<211> 6046 
<212> DNA 

<213> Artificial sequence 

< 220 > ..... 

<223> Description of the artificial sequence: binary 

vector pSUNl 
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sequence listing 



<400> 56 
ttccatggac 
ttattctaat 
gatagtttaa 
tgattacgcc 
cgggctcgag 
ggtaataatt 
gacaagtatc 
ttgtctaagc 
ccccgaccgg 
ggtgttgacc 
ccggagcggg 
caccccggca 
agaggcggct 
cgaggaagtg 
cgaggccgac 
caccaggacg 
gcggccgggt 
ctggccggtt 
accgagcgcc 
gtcgctgcgt 
ttatcgctgt 
cccgcgccct 
gtgcccgcga 
gcccgacgat 
gagcgcccca 
ttccggtgca 
agcagcgcat 
tcaaaggcac 
ttcttgagtc 
ccgttcttga 
aaattaaatc 
cacgctaagt 
ggcagacacg 
gctgaagatg 
cgcgcagcta 
aaggaggcgg 
gtggaggaac 
gcactggaac 
acaaatcggc 
ccagcggcaa 
tcgaatccgc 
gcccaagggc 
cgatagtcgc 
tggcgaggtg 
cggcatggcc 
atccatgaac 
cgttgcggac 
ggtagaaacc 
caagaacggc 
cgtaaagagc 
ccgcgagatc 
gatcgatccc 
agaagccaga 
gaagttctgt 
gaaggaggag 
gggcgaagca 
aggggaaaaa 
gccgtacatt 
gtcacacatg 
tttaaaactt 
cacagccgaa 



atacaaatgg 
aaacgctctt 
actgaaggcg 
aagcttgcat 
gtaccgagct 
gtcattagat 
aaacggatgt 
gtcaatttgt 
cagctcggca 
gtgttcgccg 
cgcgaggccg 
cagatcgcgc 
gcactgcttg 
acgcccaccg 
gccctggcgg 
gccaggacga 
acgtgttcga 
tgtctgatgc 
gccgtctaaa 
atatgatgcg 
acttaaccag 
gcaactcgcc 
ttgggcggcc 
tgaccgcgac 
ggcggcggac 
gccaagccct 
tgaggtcacg 
gcgcatcggc 
ccgtatcacg 
atcagaaccc 
aaaactcatt 
gccggccgtc 
ccagccatga 
tacgcggtac 
ccagagtaaa 
catggaaaat 
gggcggttgg 
ccccaagccc 
gcggcgctgg 
cgcatcgagg 
aaagaatccc 
gacgagcaac 
agcatcatgg 
atccgctacg 
agtgtgtggg 
cgataccggg 
gtactcaagt 
tgcattcggt 
cgcctggtga 
gaaaccgggc 
acagaaggca 
ggcatcggcc 
tggttgttca 
ttcaccgtgc 
gcggggcagg 
tccgccggtt 
ggtcgaaaag 
gggaaccgga 
taagtgactg 
attaaaactc 
gagctgcaaa 



acgaacggat 
ttctcttagg 
ggaaacgaca 
gcctgcaggt 
cgaattcact 
tgtttttatg 
taattcagta 
ttacaccaca 
caaaatcacc 
gcattgccga 
ccaaggcccg 
acgcccgcga 
gcgtgcatcg 
aggccaggcg 
ccgccgagaa 
accgtttttc 
gccgcccgcg 
caagctggcg 
aaggtgatgt 
atgagtaaat 
aaaggcgggt 
ggggccgatg 
gtgcgggaag 
gtgaaggcca 
ttggctgtgt 
tacgacatat 
gatggaaggc 
ggtgaggttg 
cagcgcgtga 
gagggcgacg 
tgagttaatg 
cgagcgcacg 
agcgggtcaa 
gccaaggcaa 
tgagcaaatg 
caagaacaac 
ccaggcgtaa 
gaggaatcgg 
gtgatgacct 
cagaagcacg 
ggcaaccgcc 
cagatttttt 
acgtggccgt 
agcttccaga 
attacgacct 
aagggaaggg 
tctgccggcg 
taaacaccac 
cggtatccga 
ggccggagta 
agaacccgga 
gttttctcta 
agacgatcta 
gcaagctgat 
ctggcccgat 
cctaatgtac 
gtctctttcc 
acccgtacat 
atataaaaga 
ttaaaacccg 
aagcgcctac 



aaaccttttc 
tttacccgcc 
atcagatcta 
cgactctaga 
ggccgtcgtt 
catagatgca 
cattaaagac 
atatatcctg 
acgcgttacc 
gttcgagcgt 
aggcgtgaag 
gctgatcgac 
ctcgaccctg 
gcgcggtgcc 
tgaacgccaa 
attaccgaag 
cacgtctcaa 
gcctggccgg 
gtatttgagt 
aaacaaatac 
caggcaagac 
ttctgttagt 
atcaaccgct 
tcggccggcg 
ccgcgatcaa 
gggccaccgc 
tacaagcggc 
ccgaggcgct 
gctacccagg 
ctgcccgcga 
aggtaaagag 
cagcagcaag 
ctttcagttg 
gaccattacc 
aataaatgag 
caggcaccga 
gcggctgggt 
cgtgagcggt 
ggtggagaag 
ccccggtgaa 
ggcagccggt 
cgttccgatg 
tttccgtctg 
cgggcacgta 
ggtactgatg 
agacaagccc 
agccgatggc 
gcacgttgcc 
gggtgaagcc 
catcgagatc 
cgtgctgacg 
ccgcctggca 
cgaacgcagt 
cgggtcaaat 
cctagtcatg 
ggagcagatg 
tgtggatagc 
tgggaaccca 
gaaaaaaggc 
cctggcctgt 
ccttcggtcg 
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acgccctttt 
aatatatcct 
gtaggaaaca 
ctagtggatc 
ttacaacgac 
ctcgaaatca 
gtccgcaatg 
ccaccagcca 
accacgccgg 
tccctaatca 
tttggccccc 
caggaaggcc 
taccgcgcac 
ttccgtgagg 
gaggaacaag 
agatcgaggc 
ccgtgcggct 
ccagcttggc 
aaaacagctt 
gcaaggggaa 
gaccatcgca 
cgattccgat 
aaccgttgtc 
cgacttcgta 
ggcagccgac 
cgacctggtg 
ctttgtcgtg 
ggccgggtac 
cactgccgcc 
ggtccaggcg 
aaaatgagca 
gctgcaacgt 
ccggcggagg 
gagctgctat 
tagatgaatt 
cgccgtggaa 
tgtctgccgg 
cgcaaaccat 
ttgaaggccg 
tcgtggcaag 
gcgccgtcga 
ctctatgacg 
tcgaagcgtg 
gaggtttccg 
gcggtttccc 
ggccgcgtgt 
ggaaagcaga 
atgcagcgta 
ttgattagcc 
gagctagctg 
gttcaccccg 
cgccgcgccg 
ggcagcgccg 
gacctgccgg 
cgctaccgca 
ctagggcaaa 
acgtacattg 
aagccgtaca 
gatttttccg 
gcataactgt 
ctgcgctccc 
80 



aaatatccga 60 
gtcaaacact 120 
gctatgacca 180 
cgatatcgcc 240 
tcagctgctt 300 
gccaatttta 360 
tgttattaag 420 
gccaacagct 480 
ccggccgcat 540 
tcgaccgcac 600 
gccctaccct 660 
gcaccgtgaa 720 
ttgagcgcag 780 
acgcattgac 840 
catgaaaccg 900 
ggagatgatc 960 
gcatgaaatc 1020 
cgctgaagaa 1080 
gcgtcatgcg 1140 
cgcatgaagg 1200 
acccatctag 1260 
ccccagggca 1320 
ggcatcgacc 1380 
gtgatcgacg 1440 
ttcgtgctga 1500 
gagctggtta 1560 
tcgcgggcga 1620 
gagctgccca 1680 
gccggcacaa 1740 
ctggccgctg 1800 
aaagcacaaa 1860 
tggccagcct 1920 
atcacaccaa 1980 
ctgaatacat 2040 
ttagcggcta 2100 
tgccccatgt 2160 
ccctgcaatg 2220 
ccggcccggt 2280 
cgcaggccgc 2340 
cggccgctga 2400 
ttaggaagcc 2460 
tgggcacccg 2520 
accgacgagc 2580 
cagggccggc 2640 
atctaaccga 2700 
tccgtccaca 2760 
aagacgacct 2820 
cgaagaaggc 2880 
gctacaagat 2940 
attggatgta 3000 
attacttttt 3060 
caggcaaggc 3120 
gagagttcaa 3180 
agtacgattt 3240 
acctgatcga 3300 
ttgccctagc 3360 
ggaacccaaa 3420 
ttgggaaccg 3480 
cctaaaactc 3540 
ctggccagcg 3600 
tacgccccgc 3660 



sequence listing 

cgcttcgcgt cggcctatcg cggccgctgg ccgctcaaaa atggctggcc tacggccagg 3720 
caatctacca gggcgcggac aagccgcgcc gtcgccactc gaccgccggc gcccacatca 3780 
aggcaccctg cctcgcgcgt ttcggtgatg acggtgaaaa cctctgacac atgcagctcc 3840 
cggagacggt cacagcttgt ctgtaagcgg atgccgggag cagacaagcc cgtcagggcg 3900 
cgtcagcggg tgttggcggg tgtcggggcg cagccatgac ccagtcacgt agcgatagcg 3960 
gagtgtatac tggcttaact atgcggcatc agagcagatt gtactgagag tgcaccatat 4020 
gcggtgtgaa ataccgcaca gatgcgtaag gagaaaatac cgcatcaggc gctcttccgc 4080 
ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca 4140 
ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg 4200 
agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca 4260 
taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa 4320 
cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc 4380 
tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc 4440 
gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct 4500 
gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg 4560 
tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag 4620 
gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt ggcctaacta 4680 
cggctacact agaaggacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg 4740 
aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt 4800 
tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt 4860 
ttctacgggg tctgacgctc agtggaacga aaactcacgt taagggattt tggtcatgca 4920 
tgatatatct cccaatttgt gtagggctta ttatgcacgc ttaaaaataa taaaagcaga 4980 
cttgacctga tagtttggct gtgagcaatt atgtgcttag tgcatctaac gcttgagtta 5040 
agccgcgccg cgaagcggcg tcggcttgaa cgaatttcta gctagacatt atttgccgac 5100 
taccttggtg atctcgcctt tcacgtagtg gacaaattct tccaactgat ctgcgcgcga 5160 
ggccaagcga tcttcttctt gtccaagata agcctgtcta gcttcaagta tgacgggctg 5220 
atactgggcc ggcaggcgct ccattgccca gtcggcagcg acatccttcg gcgcgatttt 5280 
gccggttact gcgctgtacc aaatgcggga caacgtaagc actacatttc gctcatcgcc 5340 
agcccagtcg ggcggcgagt tccatagcgt taaggtttca tttagcgcct caaatagatc 5400 
ctgttcagga accggatcaa agagttcctc cgccgctgga cctaccaagg caacgctatg 5460 
ttctcttgct tttgtcagca agatagccag atcaatgtcg atcgtggctg gctcgaagat 5520 
acctgcaaga atgtcattgc gctgccattc tccaaattgc agttcgcgct tagctggata 5580 
acgccacgga atgatgtcgt cgtgcacaac aatggtgact tctacagcgc ggagaatctc 5640 
gctctctcca ggggaagccg aagtttccaa aaggtcgttg atcaaagctc gccgcgttgt 5700 
ttcatcaagc cttacggtca ccgtaaccag caaatcaata tcactgtgtg gcttcaggcc 5760 
gccatccact gcggagccgt acaaatgtac ggccagcaac gtcggttcga gatggcgctc 5820 
gatgacgcca actacctctg atagttgagt cgatacttcg gcgatcaccg cttcccccat 5880 
gatgtttaac tttgttttag ggcgactgcc ctgctgcgta acatcgttgc tgctccataa 5940 
catcaaacat cgacccacgg cgtaacgcgc ttgctgcttg gatgcccgag gcatagactg 6000 
taccccaaaa aaacagtcat aacaagccat gaaaaccgcc actgcg 6046 
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<400> 57 

cgaattcact ggccgtcgtt ttacaacgac tcagctgctt ggtaataatt gtcattagat 60 

tgtttttatg catagatgca ctcgaaatca gccaatttta gacaagtatc aaacggatgt 120 

taattcagta cattaaagac gtccgcaatg tgttattaag ttgtctaagc gtcaatttgt 180 

ttacaccaca atatatcctg ccaccagcca gccaacagct ccccgaccgg cagctcggca 240 

caaaatcacc acgcgttacc accacgccgg ccggccgcat ggtgttgacc gtgttcgccg 300 

gcattgccga gttcgagcgt tccctaatca tcgaccgcac ccggagcggg cgcgaggccg 360 

ccaaggcccg aggcgtgaag tttggccccc gccctaccct caccccggca cagatcgcgc 420 

acgcccgcga gctgatcgac caggaaggcc gcaccgtgaa agaggcggct gcactgcttg 480 

gcgtgcatcg ctcgaccctg taccgcgcac ttgagcgcag cgaggaagtg acgcccaccg 540 

aggccaggcg gcgcggtgcc ttccgtgagg acgcattgac cgaggccgac gccctggcgg 600 

ccgccgagaa tgaacgccaa gaggaacaag catgaaaccg caccaggacg gccaggacga 660 

accgtttttc attaccgaag agatcgaggc ggagatgatc gcggccgggt acgtgttcga 720 
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gccgcccgcg cacgtctcaa ccgtgcggct 
caagctggcg gcctggccgg ccagcttggc 
aaggtgatgt gtatttgagt aaaacagctt 
atgagtaaat aaacaaatac gcaaggggaa 
aaaggcgggt caggcaagac gaccatcgca 
ggggccgatg ttctgttagt cgattccgat 
gtgcgggaag atcaaccgct aaccgttgtc 
gtgaaggcca tcggccggcg cgacttcgta 
ttggctgtgt ccgcgatcaa ggcagccgac 
tacgacatat gggccaccgc cgacctggtg 
gatggaaggc tacaagcggc ctttgtcgtg 
ggtgaggttg ccgaggcgct ggccgggtac 
cagcgcgtga gctacccagg cactgccgcc 
gagggcgacg ctgcccgcga ggtccaggcg 
tgagttaatg aggtaaagag aaaatgagca 
cgagcgcacg cagcagcaag gctgcaacgt 
agcgggtcaa ctttcagttg ccggcggagg 
gccaaggcaa gaccattacc gagctgctat 
tgagcaaatg aataaatgag tagatgaatt 
caagaacaac caggcaccga cgccgtggaa 
ccaggcgtaa gcggctgggt tgtctgccgg 
gaggaatcgg cgtgagcggt cgcaaaccat 
gtgatgacct ggtggagaag ttgaaggccg 
cagaagcacg ccccggtgaa tcgtggcaag 
ggcaaccgcc ggcagccggt gcgccgtcga 
cagatttttt cgttccgatg ctctatgacg 
acgtggccgt tttccgtctg tcgaagcgtg 
agcttccaga cgggcacgta gaggtttccg 
attacgacct ggtactgatg gcggtttccc 
aagggaaggg agacaagccc ggccgcgtgt 
tctgccggcg agccgatggc ggaaagcaga 
taaacaccac gcacgttgcc atgcagcgta 
cggtatccga gggtgaagcc ttgattagcc 
ggccggagta catcgagatc gagctagctg 
agaacccgga cgtgctgacg gttcaccccg 
gttttctcta ccgcctggca cgccgcgccg 
agacgatcta cgaacgcagt ggcagcgccg 
gcaagctgat cgggtcaaat gacctgccgg 
ctggcccgat cctagtcatg cgctaccgca 
cctaatgtac ggagcagatg ctagggcaaa 
gtctctttcc tgtggatagc acgtacattg 
acccgtacat tgggaaccca aagccgtaca 
atataaaaga gaaaaaaggc gatttttccg 
ttaaaacccg cctggcctgt gcataactgt 
aagcgcctac ccttcggtcg ctgcgctccc 
cggccgctgg ccgctcaaaa atggctggcc 
aagccgcgcc gtcgccactc gaccgccggc 
ttcggtgatg acggtgaaaa cctctgacac 
ctgtaagcgg atgccgggag cagacaagcc 
tgtcggggcg cagccatgac ccagtcacgt 
atgcggcatc agagcagatt gtactgagag 
gatgcgtaag gagaaaatac cgcatcaggc 
tgcgctcggt cgttcggctg cggcgagcgg 
tatccacaga atcaggggat aacgcaggaa 
ccaggaaccg taaaaaggcc gcgttgctgg 
agcatcacaa aaatcgacgc tcaagtcaga 
accaggcgtt tccccctgga agctccctcg 
ccggatacct gtccgccttt ctcccttcgg 
gtaggtatct cagttcggtg taggtcgttc 
ccgttcagcc cgaccgctgc gccttatccg 
gacacgactt atcgccactg gcagcagcca 
taggcggtgc tacagagttc ttgaagtggt 
tatttggtat ctgcgctctg ctgaagccag 



sequence listing 
gcatgaaatc ctggccggtt tgtctgatgc 780 
cgctgaagaa accgagcgcc gccgtctaaa 840 
gcgtcatgcg gtcgctgcgt atatgatgcg 900 
cgcatgaagg ttatcgctgt acttaaccag 960 
acccatctag cccgcgccct gcaactcgcc 1020 
ccccagggca gtgcccgcga ttgggcggcc 1080 
ggcatcgacc gcccgacgat tgaccgcgac 1140 
gtgatcgacg gagcgcccca ggcggcggac 1200 
ttcgtgctga ttccggtgca gccaagccct 1260 
gagctggtta agcagcgcat tgaggtcacg 1320 
tcgcgggcga tcaaaggcac gcgcatcggc 1380 
gagctgccca ttcttgagtc ccgtatcacg 1440 
gccggcacaa ccgttcttga atcagaaccc 1500 
ctggccgctg aaattaaatc aaaactcatt 1560 
aaagcacaaa cacgctaagt gccggccgtc 1620 
tggccagcct ggcagacacg ccagccatga 1680 
atcacaccaa gctgaagatg tacgcggtac 1740 
ctgaatacat cgcgcagcta ccagagtaaa 1800 
ttagcggcta aaggaggcgg catggaaaat 1860 
tgccccatgt gtggaggaac gggcggttgg 1920 
ccctgcaatg gcactggaac ccccaagccc 1980 
ccggcccggt acaaatcggc gcggcgctgg 2040 
cgcaggccgc ccagcggcaa cgcatcgagg 2100 
cggccgctga tcgaatccgc aaagaatccc 2160 
ttaggaagcc gcccaagggc gacgagcaac 2220 
tgggcacccg cgatagtcgc agcatcatgg 2280 
accgacgagc tggcgaggtg atccgctacg 2340 
cagggccggc cggcatggcc agtgtgtggg 2400 
atctaaccga atccatgaac cgataccggg 2460 
tccgtccaca cgttgcggac gtactcaagt 2520 
aagacgacct ggtagaaacc tgcattcggt 2580 
cgaagaaggc caagaacggc cgcctggtga 2640 
gctacaagat cgtaaagagc gaaaccgggc 2700 
attggatgta ccgcgagatc acagaaggca 2760 
attacttttt gatcgatccc ggcatcggcc 2820 
caggcaaggc agaagccaga tggttgttca 2880 
gagagttcaa gaagttctgt ttcaccgtgc 2940 
agtacgattt gaaggaggag gcggggcagg 3000 
acctgatcga gggcgaagca tccgccggtt 3060 
ttgccctagc aggggaaaaa ggtcgaaaag 3120 
ggaacccaaa gccgtacatt gggaaccgga 3180 
ttgggaaccg gtcacacatg taagtgactg 3240 
cctaaaactc tttaaaactt attaaaactc 3300 
ctggccagcg cacagccgaa gagctgcaaa 3360 
tacgccccgc cgcttcgcgt cggcctatcg 3420 
tacggccagg caatctacca gggcgcggac 3480 
gcccacatca aggcaccctg cctcgcgcgt 3540 
atgcagctcc cggagacggt cacagcttgt 3600 
cgtcagggcg cgtcagcggg tgttggcggg 3660 
agcgatagcg gagtgtatac tggcttaact 3720 
tgcaccatat gcggtgtgaa ataccgcaca 3780 
gctcttccgc ttcctcgctc actgactcgc 3840 
tatcagctca ctcaaaggcg gtaatacggt 3900 
agaacatgtg agcaaaaggc cagcaaaagg 3960 
cgtttttcca taggctccgc ccccctgacg 4020 
ggtggcgaaa cccgacagga ctataaagat 4080 
tgcgctctcc tgttccgacc ctgccgctta 4140 
gaagcgtggc gctttctcat agctcacgct 4200 
gctccaagct gggctgtgtg cacgaacccc 4260 
gtaactatcg tcttgagtcc aacccggtaa 4320 
ctggtaacag gattagcaga gcgaggtatg 4380 
ggcctaacta cggctacact agaaggacag 4440 
ttaccttcgg aaaaagagtt ggtagctctt 4500 
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gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta 4560 
cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc 4620 
agtggaacga aaactcacgt taagggattt tggtcatgca tgatatatct cccaatttgt 4680 
gtagggctta ttatgcacgc ttaaaaataa taaaagcaga cttgacctga tagtttggct 4740 
gtgagcaatt atgtgcttag tgcatctaac gcttgagtta agccgcgccg cgaagcggcg 4800 
tcggcttgaa cgaatttcta gctagacatt atttgccgac taccttggtg atctcgcctt 4860 
tcacgtagtg gacaaattct tccaactgat ctgcgcgcga ggccaagcga tcttcttctt 4920 
gtccaagata agcctgtcta gcttcaagta tgacgggctg atactgggcc ggcaggcgct 4980 
ccattgccca gtcggcagcg acatccttcg gcgcgatttt gccggttact gcgctgtacc 5040 
aaatgcggga caacgtaagc actacatttc gctcatcgcc agcccagtcg ggcggcgagt 5100 
tccatagcgt taaggtttca tttagcgcct caaatagatc ctgttcagga accggatcaa 5160 
agagttcctc cgccgctgga cctaccaagg caacgctatg ttctcttgct tttgtcagca 5220 
agatagccag atcaatgtcg atcgtggctg gctcgaagat acctgcaaga atgtcattgc 5280 
gctgccattc tccaaattgc agttcgcgct tagctggata acgccacgga atgatgtcgt 5340 
cgtgcacaac aatggtgact tctacagcgc ggagaatctc gctctctcca ggggaagccg 5400 
aagtttccaa aaggtcgttg atcaaagctc gccgcgttgt ttcatcaagc cttacggtca 5460 
ccgtaaccag caaatcaata tcactgtgtg gcttcaggcc gccatccact gcggagccgt 5520 
acaaatgtac ggccagcaac gtcggttcga gatggcgctc gatgacgcca actacctctg 5580 
atagttgagt cgatacttcg gcgatcaccg cttcccccat gatgtttaac tttgttttag 5640 
ggcgactgcc ctgctgcgta acatcgttgc tgctccataa catcaaacat cgacccacgg 5700 
cgtaacgcgc ttgctgcttg gatgcccgag gcatagactg taccccaaaa aaacagtcat 5760 
aacaagccat gaaaaccgcc actgcgttcc atggacatac aaatggacga acggataaac 5820 
cttttcacgc ccttttaaat atccgattat tctaataaac gctcttttct cttaggttta 5880 
cccgccaata tatcctgtca aacactgata gtttaaactg aaggcgggaa acgacaatca 5940 
gatctagtag gaaacagcta tgaccatgat tacgccaagc ttgcatgcct gcaggtcgac 6000 
tctagactag tggatccgat atcgcccggg ctcgaggtac ccatcgcgat ccccgtcacc 6060 
ggtgtgaggg aactagtttt gatcttgaaa gatcttttat ctttagagtt aagaactctt 6120 
tcgtattttg gtgaggtttt atcctcttga gttttggtca tagacctatt catggctctg 6180 
ataccaattt ttaagcgggg gcttatgcgg attatttctt aaattgataa ggggttatta 6240 
gggggtatag ggtataaata caagcattcc cttagcgtat agtataagta tagtagcgta 6300 
cctctatcaa atttccatct tcttaccttg cacagggcct gcaaccttat ccttccttgt 6360 
cttcctcctt ccttccgtcc acttcatcat atttaaacca aacctacggg ggagtcaacg 6420 
taaccaaccc tgccttagca tcttttccct aacggcctcc tgcctaagcg gtacttctag 6480 
cttcgaacgg cgtctgggct ccaggtttag tcgtctcgtg tctggtttat attcacgaca 6540 
aagatctata gggactttag gagatctgga ttttagtact ggattttggt tttaggaatt 6600 
agaaatttta ttgatagaag tattttacaa atacaaatac atactaaggg tttcttatat 6660 
gctcaacaca tgagcgaaac cctataagaa ccctaanttc cccttatcgg gaaactactc 6720 
acacattatt tatggagaaa atagagagag atagatttgt agagagagac tggtgatttc 6780 
agcgtaccga attcggctaa cagtgtcgaa taacgcttta caaacaatta ttaacgcccg 6840 
gttaccaggc gaagaggggc tgtggcagat tcatctgcag gacggaaaaa tcagcgccat 6900 
tgatgcgcaa tccggcgtga tgcccataac tgaaaacagc ctggatgccg aacaaggttt 6960 
agttataccg ccgtttgtgg agccacatat tcacctggac accacgcaaa ccgccggaca 7020 
accgaactgg aatcagtccg gcacgctgtt tgaaggcatt gaacgctggg ccgagcgcaa 7080 
agcgttatta acccatgacg atgtgaaaca acgcgcatgg caaacgctga aatggcagat 7140 
tgccaacggc attcagcatg tgcgtaccca tgtcgatgtt tcggatgcaa cgctaactgc 7200 
gctgaaagca atgctggaag tgaagcagga agtcgcgccg tggattgatc tgcaaatcgt 7260 
cgccttccct caggaaggga ttttgtcgga tccggtgata cctgcacatc aacaaatttt 7320 
ggtcatatat tagaaaagtt ataaattaaa atatacacac ttataaacta cagaaaagca 7380 
attgctatat actacattct tttattttga aaaaaatatt tgaaatatta tattactact 7440 
aattaatgat aattattata tatatatcaa aggtagaagc agaaacttac gtagtcgacg 7500 
acaaaatccc ttcctgaggg aaggcgacga tttgcagatc aatccacggc gcgacttcct 7560 
gcttcacttc cagcattgct ttcagcgcag ttagcgttgc atccgaaaca tcgacatggg 7620 
tacgcacatg ctgaatgccg ttggcaatct gccatttcag cgtttgccat gcgcgttgtt 7680 
tcacatcgtc atgggttaat aacgctttgc gctcggccca gcgttcaatg ccttcaaaca 7740 
gcgtgccgga ctgattccag ttcggttgtc cggcggtttg cgtggtgtcc aggtgaatat 7800 
gtggctccac aaacggcggt ataactaaac cttgttcggc atccaggctg ttttcagtta 7860 
tgggcatcac gccggattgc gcatcaatgg cgctgatttt tccgtcctgc agatgaatct 7920 
gccacagccc ctcttcgcct ggtaaccggg cgttaataat tgtttgtaaa gcgttattcg 7980 
acactgttag ccaagcttgc atgcctgcag gtcgagtctt tgttttttac tttggttcat 8040 
gacactcaga gacttgagag aagcaatata tagacttttt tttgtttttt ttttgtggtc 8100 
acgtttattt tcctattgga gacggtaacg aagatcgaac ctgtggtgga aatgaaacaa 8160 
ggtgggacta gcccacgtgg tttcttttct ctgcattgat ttgtttttgt tttttttgta 8220 
aagttcacat caaacctact aataattgag aagaaaaata aaatctattg attgattaaa 8280 
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ccagccgatg 
acggtggttt 
actaccaaat 
gaatttattc 
aggtaataat 
tggtcgtacg 
gtattttttt 
gtcataaata 
aaatgtgagt 
aattatctca 
gtcaaatatc 
ttcctacaag 
caattagttc 
tatttcattg 
tttcttactt 
ttgtcaatga 
ttctgattgt 
catatacgta 
catttttaag 
cagtaactta 
acaactgaaa 
acacaacatc 
gagaattggt 
tgaaactcaa 
tcacaaccga 
ccggtttctt 



ctttatgtct 
acaaagtatt 
aaaaaacaaa 
ttttatggat 
atatactatg 
attctaccta 
tttggtttgt 
aaatttgacg 
ttggctccta 
gagaaactgg 
tttggattta 
agaaatcaaa 
gcatcaagaa 
cagctttcag 
gcaggaactc 
aacgaaatag 
gacagccatc 
atggtttagc 
aaaaataaat 
attattaaac 
tcaaagccag 
cagaattgaa 
tctcttccat 
cataacattt 
ttacttgttt 
cgggtgctca 



gaatataaaa 
tggacacaat 
taaaaaactt 
tccttaaatc 
ttatagatat 
ataatattga 
cttagctgta 
tatagatttt 
tttaatctta 
attaaataaa 
ctattattgg 
atttttgaat 
aaccaaaccc 
ataaaagaaa 
aaattccact 
tctttttatg 
catatatata 
atagcttcta 
taatatatgt 
attctaaatg 
attcataata 
cactcaaatt 
tattaacgaa 
tgaacttgac 
gagtcatctt 
tttacggtca 



sequence 1 
aagaagtgaa 
aaatccaaac 
aaaagaattt 
catatttgat 
gtttctaatt 
acggaattga 
tttcattaaa 
ttaaatccat 
ggatgggtta 
ctaaaaaata 
cgaaaagaaa 
taacatggat 
caattaataa 
aaaacacaca 
ttgccacttt 
caaacactgt 
gggaatgtaa 
ttttcgttgt 
atattcttgg 
caaatatgca 
attggctaca 
ggatgataga 
aataaagaaa 
tccttcatag 
ccgctttctc 
agtgtaaaac 



i sting 

aaccccgttt 

gaaataaaac 

attccatttt 

gcattttgat 

cgtattaacc 

tgttttggac 

tataaattta 

tttatgttat 

atactaagtt 

gatcaatgtg 

gtctcatata 

tgtatagttt 

tcaaacgggc 

ctcaagtctt 

tctttacaaa 

ttgtcttttt 

aacaacaaca 

caatattagt 

aactaatgta 

aagaaaaaaa 

tggttgtaga 

tggataatct 

aaaagtttag 

gagtgacatg 

caccttcgaa 

atctggtctc 



aagaattaca 
aaaatggaga 
ttttcccgta 
tcctcataat 
tacctttttt 
cacttagaaa 
aataagaaat 
ttaatatttg 
ttccttaatg 
ttttggtccg 
gtaaatcata 
cttatataac 
ttggtaggaa 
ttatttcatc 
taaacacaaa 
tcgatcacgt 
tgtgaagtca 
cattccaaaa 
tgtggaaata 
agaaaagaac 
atgtagggta 
ttagatacaa 
cataaaagtt 
aactgacgaa 
atgaatgtga 
gacgagct 



8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9838 



<210> 58 
<211> 14184 
<212> DNA 

<213> Artificial sequence 

< 220 > ..... 

<223> Description of the artificial sequence: Expression 
vector pSUNl-codA-RNAi -At .Act . -2-At . Al s-R-ocsT 



<400> 58 

ctgcttggta 

attttagaca 

attaagttgt 

acagctcccc 

ccgcatggtg 

ccgcacccgg 

taccctcacc 

cgtgaaagag 

gcgcagcgag 

attgaccgag 

aaaccgcacc 

atgatcgcgg 

gaaatcctgg 

gaagaaaccg 

catgcggtcg 

tgaaggttat 

atctagcccg 

agggcagtgc 

tcgaccgccc 

tcgacggagc 

tgctgattcc 

tggttaagca 

gggcgatcaa 

tgcccattct 

gcacaaccgt 

ccgctgaaat 



ataattgtca 
agtatcaaac 
ctaagcgtca 
gaccggcagc 
ttgaccgtgt 
agcgggcgcg 
ccggcacaga 
gcggctgcac 
gaagtgacgc 
gccgacgccc 
aggacggcca 
ccgggtacgt 
ccggtttgtc 
agcgccgccg 
ctgcgtatat 
cgctgtactt 
cgccctgcaa 
ccgcgattgg 
gacgattgac 
gccccaggcg 
ggtgcagcca 
gcgcattgag 
aggcacgcgc 
tgagtcccgt 
tcttgaatca 
taaatcaaaa 



ttagattgtt 
ggatgttaat 
atttgtttac 
tcggcacaaa 
tcgccggcat 
aggccgccaa 
tcgcgcacgc 
tgcttggcgt 
ccaccgaggc 
tggcggccgc 
ggacgaaccg 
gttcgagccg 
tgatgccaag 
tctaaaaagg 
gatgcgatga 
aaccagaaag 
ctcgccgggg 
gcggccgtgc 
cgcgacgtga 
gcggacttgg 
agcccttacg 
gtcacggatg 
atcggcggtg 
atcacgcagc 
gaacccgagg 
ctcatttgag 



tttatgcata 
tcagtacatt 
accacaatat 
atcaccacgc 
tgccgagttc 
ggcccgaggc 
ccgcgagctg 
gcatcgctcg 
caggcggcgc 
cgagaatgaa 
tttttcatta 
cccgcgcacg 
ctggcggcct 
tgatgtgtat 
gtaaataaac 
gcgggtcagg 
ccgatgttct 
gggaagatca 
aggccatcgg 
ctgtgtccgc 
acatatgggc 
gaaggctaca 
aggttgccga 
gcgtgagcta 
gcgacgctgc 
ttaatgaggt 
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gatgcactcg 
aaagacgtcc 
atcctgccac 
gttaccacca 
gagcgttccc 
gtgaagtttg 
atcgaccagg 
accctgtacc 
ggtgccttcc 
cgccaagagg 
ccgaagagat 
tctcaaccgt 
ggccggccag 
ttgagtaaaa 
aaatacgcaa 
caagacgacc 
gttagtcgat 
accgctaacc 
ccggcgcgac 
gatcaaggca 
caccgccgac 
agcggccttt 
ggcgctggcc 
cccaggcact 
ccgcgaggtc 
aaagagaaaa 
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aaatcagcca 
gcaatgtgtt 
cagccagcca 
cgccggccgg 
taatcatcga 
gcccccgccc 
aaggccgcac 
gcgcacttga 
gtgaggacgc 
aacaagcatg 
cgaggcggag 
gcggctgcat 
cttggccgct 
cagcttgcgt 
ggggaacgca 
atcgcaaccc 
tccgatcccc 
gttgtcggca 
ttcgtagtga 
gccgacttcg 
ctggtggagc 
gtcgtgtcgc 
gggtacgagc 
gccgccgccg 
caggcgctgg 
tgagcaaaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 



cacaaacacg 
cagcctggca 
caccaagctg 
atacatcgcg 
cggctaaagg 
ccatgtgtgg 
gcaatggcac 
cccggtacaa 
ggccgcccag 
cgctgatcga 
gaagccgccc 
cacccgcgat 
acgagctggc 
gccggccggc 
aaccgaatcc 
tccacacgtt 
cgacctggta 
gaaggccaag 
caagatcgta 
gatgtaccgc 
ctttttgatc 
caaggcagaa 
gttcaagaag 
cgatttgaag 
gatcgagggc 
cctagcaggg 
cccaaagccg 
gaaccggtca 
aaactcttta 
ccagcgcaca 
ccccgccgct 
gccaggcaat 
acatcaaggc 
agctcccgga 
agggcgcgtc 
atagcggagt 
ccatatgcgg 
ttccgcttcc 
agctcactca 
catgtgagca 
tttccatagg 
gcgaaacccg 
ctctcctgtt 
cgtggcgctt 
caagctgggc 
ctatcgtctt 
taacaggatt 
taactacggc 
cttcggaaaa 
tttttttgtt 
gatcttttct 
catgcatgat 
agcagacttg 
gagttaagcc 
gccgactacc 
gcgcgaggcc 
gggctgatac 
gattttgccg 
atcgccagcc 
tagatcctgt 
gctatgttct 
gaagatacct 
tggataacgc 



ctaagtgccg 
gacacgccag 
aagatgtacg 
cagctaccag 
aggcggcatg 
aggaacgggc 
tggaaccccc 
atcggcgcgg 
cggcaacgca 
atccgcaaag 
aagggcgacg 
agtcgcagca 
gaggtgatcc 
atggccagtg 
atgaaccgat 
gcggacgtac 
gaaacctgca 
aacggccgcc 
aagagcgaaa 
gagatcacag 
gatcccggca 
gccagatggt 
ttctgtttca 
gaggaggcgg 
gaagcatccg 
gaaaaaggtc 
tacattggga 
cacatgtaag 
aaacttatta 
gccgaagagc 
tcgcgtcggc 
ctaccagggc 
accctgcctc 
gacggtcaca 
agcgggtgtt 
gtatactggc 
tgtgaaatac 
tcgctcactg 
aaggcggtaa 
aaaggccagc 
ctccgccccc 
acaggactat 
ccgaccctgc 
tctcatagct 
tgtgtgcacg 
gagtccaacc 
agcagagcga 
tacactagaa 
agagttggta 
tgcaagcagc 
acggggtctg 
atatctccca 
acctgatagt 
gcgccgcgaa 
ttggtgatct 
aagcgatctt 
tgggccggca 
gttactgcgc 
cagtcgggcg 
tcaggaaccg 
cttgcttttg 
gcaagaatgt 
cacggaatga 



gccgtccgag 
ccatgaagcg 
cggtacgcca 
agtaaatgag 
gaaaatcaag 
ggttggccag 
aagcccgagg 
cgctgggtga 
tcgaggcaga 
aatcccggca 
agcaaccaga 
tcatggacgt 
gctacgagct 
tgtgggatta 
accgggaagg 
tcaagttctg 
ttcggttaaa 
tggtgacggt 
ccgggcggcc 
aaggcaagaa 
tcggccgttt 
tgttcaagac 
ccgtgcgcaa 
ggcaggctgg 
ccggttccta 
gaaaaggtct 
accggaaccc 
tgactgatat 
aaactcttaa 
tgcaaaaagc 
ctatcgcggc 
gcggacaagc 
gcgcgtttcg 
gcttgtctgt 
ggcgggtgtc 
ttaactatgc 
cgcacagatg 
actcgctgcg 
tacggttatc 
aaaaggccag 
ctgacgagca 
aaagatacca 
cgcttaccgg 
cacgctgtag 
aaccccccgt 
cggtaagaca 
ggtatgtagg 
ggacagtatt 
gctcttgatc 
agattacgcg 
acgctcagtg 
atttgtgtag 
ttggctgtga 
gcggcgtcgg 
cgcctttcac 
cttcttgtcc 
ggcgctccat 
tgtaccaaat 
gcgagttcca 
gatcaaagag 
tcagcaagat 
cattgcgctg 
tgtcgtcgtg 



sequence 
cgcacgcagc 
ggtcaacttt 
aggcaagacc 
caaatgaata 
aacaaccagg 
gcgtaagcgg 
aatcggcgtg 
tgacctggtg 
agcacgcccc 
accgccggca 
ttttttcgtt 
ggccgttttc 
tccagacggg 
cgacctggta 
gaagggagac 
ccggcgagcc 
caccacgcac 
atccgagggt 
ggagtacatc 
cccggacgtg 
tctctaccgc 
gatctacgaa 
gctgatcggg 
cccgatccta 
atgtacggag 
ctttcctgtg 
gtacattggg 
aaaagagaaa 
aacccgcctg 
gcctaccctt 
cgctggccgc 
cgcgccgtcg 
gtgatgacgg 
aagcggatgc 
ggggcgcagc 
ggcatcagag 
cgtaaggaga 
ctcggtcgtt 
cacagaatca 
gaaccgtaaa 
tcacaaaaat 
ggcgtttccc 
atacctgtcc 
gtatctcagt 
tcagcccgac 
cgacttatcg 
cggtgctaca 
tggtatctgc 
cggcaaacaa 
cagaaaaaaa 
gaacgaaaac 
ggcttattat 
gcaattatgt 
cttgaacgaa 
gtagtggaca 
aagataagcc 
tgcccagtcg 
gcgggacaac 
tagcgttaag 
ttcctccgcc 
agccagatca 
ccattctcca 
cacaacaatg 
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listing 
agcaaggctg 
cagttgccgg 
attaccgagc 
aatgagtaga 
caccgacgcc 
ctgggttgtc 
agcggtcgca 
gagaagttga 
ggtgaatcgt 
gccggtgcgc 
ccgatgctct 
cgtctgtcga 
cacgtagagg 
ctgatggcgg 
aagcccggcc 
gatggcggaa 
gttgccatgc 
gaagccttga 
gagatcgagc 
ctgacggttc 
ctggcacgcc 
cgcagtggca 
tcaaatgacc 
gtcatgcgct 
cagatgctag 
gatagcacgt 
aacccaaagc 
aaaggcgatt 
gcctgtgcat 
cggtcgctgc 
tcaaaaatgg 
ccactcgacc 
tgaaaacctc 
cgggagcaga 
catgacccag 
cagattgtac 
aaataccgca 
cggctgcggc 
ggggataacg 
aaggccgcgt 
cgacgctcaa 
cctggaagct 
gcctttctcc 
tcggtgtagg 
cgctgcgcct 
ccactggcag 
gagttcttga 
gctctgctga 
accaccgctg 
ggatctcaag 
tcacgttaag 
gcacgcttaa 
gcttagtgca 
tttctagcta 
aattcttcca 
tgtctagctt 
gcagcgacat 
gtaagcacta 
gtttcattta 
gctggaccta 
atgtcgatcg 
aattgcagtt 
gtgacttcta 
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caacgttggc 1620 
cggaggatca 1680 
tgctatctga 1740 
tgaattttag 1800 
gtggaatgcc 1860 
tgccggccct 1920 
aaccatccgg 1980 
aggccgcgca 2040 
ggcaagcggc 2100 
cgtcgattag 2160 
atgacgtggg 2220 
agcgtgaccg 2280 
tttccgcagg 2340 
tttcccatct 2400 
gcgtgttccg 2460 
agcagaaaga 2520 
agcgtacgaa 2580 
ttagccgcta 2640 
tagctgattg 2700 
accccgatta 2760 
gcgccgcagg 2820 
gcgccggaga 2880 
tgccggagta 2940 
accgcaacct 3000 
ggcaaattgc 3060 
acattgggaa 3120 
cgtacattgg 3180 
tttccgccta 3240 
aactgtctgg 3300 
gctccctacg 3360 
ctggcctacg 3420 
gccggcgccc 3480 
tgacacatgc 3540 
caagcccgtc 3600 
tcacgtagcg 3660 
tgagagtgca 3720 
tcaggcgctc 3780 
gagcggtatc 3840 
caggaaagaa 3900 
tgctggcgtt 3960 
gtcagaggtg 4020 
ccctcgtgcg 4080 
cttcgggaag 4140 
tcgttcgctc 4200 
tatccggtaa 4260 
cagccactgg 4320 
agtggtggcc 4380 
agccagttac 4440 
gtagcggtgg 4500 
aagatccttt 4560 
ggattttggt 4620 
aaataataaa 4680 
tctaacgctt 4740 
gacattattt 4800 
actgatctgc 4860 
caagtatgac 4920 
ccttcggcgc 4980 
catttcgctc 5040 
gcgcctcaaa 5100 
ccaaggcaac 5160 
tggctggctc 5220 
cgcgcttagc 5280 
cagcgcggag 5340 



sequence listing 

aatctcgctc tctccagggg aagccgaagt ttccaaaagg tcgttgatca aagctcgccg 5400 
cgttgtttca tcaagcctta cggtcaccgt aaccagcaaa tcaatatcac tgtgtggctt 5460 
caggccgcca tccactgcgg agccgtacaa atgtacggcc agcaacgtcg gttcgagatg 5520 
gcgctcgatg acgccaacta cctctgatag ttgagtcgat acttcggcga tcaccgcttc 5580 
ccccatgatg tttaactttg ttttagggcg actgccctgc tgcgtaacat cgttgctgct 5640 
ccataacatc aaacatcgac ccacggcgta acgcgcttgc tgcttggatg cccgaggcat 5700 
agactgtacc ccaaaaaaac agtcataaca agccatgaaa accgccactg cgttccatgg 5760 
acatacaaat ggacgaacgg ataaaccttt tcacgccctt ttaaatatcc gattattcta 5820 
ataaacgctc ttttctctta ggtttacccg ccaatatatc ctgtcaaaca ctgatagttt 5880 
aaactgaagg cgggaaacga caatcagatc tagtaggaaa cagctatgac catgattacg 5940 
ccaagcttgc atgcctgcag gtcgactcta gactagtgga tccgatatcg cccgggctcg 6000 
aggtacccat cgcgatcccc gtcaccggtg tgagggaact agttttgatc ttgaaagatc 6060 
ttttatcttt agagttaaga actctttcgt attttggtga ggttttatcc tcttgagttt 6120 
tggtcataga cctattcatg gctctgatac caatttttaa gcgggggctt atgcggatta 6180 
tttcttaaat tgataagggg ttattagggg gtatagggta taaatacaag cattccctta 6240 
gcgtatagta taagtatagt agcgtacctc tatcaaattt ccatcttctt accttgcaca 6300 
gggcctgcaa ccttatcctt ccttgtcttc ctccttcctt ccgtccactt catcatattt 6360 
aaaccaaacc tacgggggag tcaacgtaac caaccctgcc ttagcatctt ttccctaacg 6420 
gcctcctgcc taagcggtac ttctagcttc gaacggcgtc tgggctccag gtttagtcgt 6480 
ctcgtgtctg gtttatattc acgacaaaga tctataggga ctttaggaga tctggatttt 6540 
agtactggat tttggtttta ggaattagaa attttattga tagaagtatt ttacaaatac 6600 
aaatacatac taagggtttc ttatatgctc aacacatgag cgaaacccta taagaaccct 6660 
aatttccctt atcgggaaac tactcacaca ttatttatgg agaaaataga gagagataga 6720 
tttgtagaga gagactggtg atttcagcgt accgaattcg attttcggct aacagtgtcg 6780 
aataacgctt tacaaacaat tattaacgcc cggttaccag gcgaagaggg gctgtggcag 6840 
attcatctgc aggacggaaa aatcagcgcc attgatgcgc aatccggcgt gatgcccata 6900 
actgaaaaca gcctggatgc cgaacaaggt ttagttatac cgccgtttgt ggagccacat 6960 
attcacctgg acaccacgca aaccgccgga caaccgaact ggaatcagtc cggcacgctg 7020 
tttgaaggca ttgaacgctg ggccgagcgc aaagcgttat taacccatga cgatgtgaaa 7080 
caacgcgcat ggcaaacgct gaaatggcag attgccaacg gcattcagca tgtgcgtacc 7140 
catgtcgatg tttcggatgc aacgctaact gcgctgaaag caatgctgga agtgaagcag 7200 
gaagtcgcgc cgtggattga tctgcaaatc gtcgccttcc ctcaggaagg gattttgtcg 7260 
gatccggtga tacctgcaca tcaacaaatt ttggtcatat attagaaaag ttataaatta 7320 
aaatatacac acttataaac tacagaaaag caattgctat atactacatt cttttatttt 7380 
gaaaaaaata tttgaaatat tatattacta ctaattaatg ataattatta tatatatatc 7440 
aaaggtagaa gcagaaactt acgtagtcga cgacaaaatc ccgtcctgag ggaaggcgac 7500 
gatttgcaga tcaatccacg gcgcgacttc ctgcttcact tccagcattg ctttcagcgc 7560 
agttagcgtt gcatccgaaa catcgacatg ggtacgcaca tgctgaatgc cgttggcaat 7620 
ctgccatttc agcgtttgcc atgcgcgttg tttcacatcg tcatgggtta ataacgcttt 7680 
gcgctcggcc cagcgttcaa tgccttcaaa cagcgtgccg gactgattcc agttcggttg 7740 
tccggcggtt tgcgtggtgt ccaggtgaat atgtggctcc acaaacggcg gtataactaa 7800 
accttgttcg gcatccaggc tgttttcagt tatgggcatc acgccggatt gcgcatcaat 7860 
ggcgctgatt tttccgtcct gcagatgaat ctgccacagc ccctcttcgc ctggtaaccg 7920 
ggcgttaata attgtttgta aagcgttatt cgacactgtt agccaagctt gcatgcctgc 7980 
aggtcgactc tagaggatcc ccgatccact cgagtctttg ttttttactt tggttcatga 8040 
cactcagaga cttgagagaa gcaatatata gacttttttt tgtttttttt ttgtggtcac 8100 
gtttattttc ctattggaga cggtaacgaa gatcgaacct gtggtggaaa tgaaacmagg 8160 
tgggactagc ccacgtggtt tcttttctct gcattgattt gtttttgttt tttytgtaaa 8220 
gttcacatca aacctactaa taattgagaa gaaaaataaa atctattgat tgattaaacc 8280 
agccgatgct ttatgtctga atataaaaaa gaagtgaaaa ccccgtttaa gaattacaac 8340 
gqtggtttac aaagtatttg gacacaataa atccaaacga aataaaacaa aatggagaac 8400 
taccaaataa aaaacaaata aaaaacttaa aagaatttat tccatttttt ttcccgtaga 8460 
atttattctt ttatggattc cttaaatcca tatttgatgc attttgattc ctcataatag 8520 
gtaataatat atactatgtt atagatatgt ttctaattcg tattaaccta cctttttttg 8580 
gtcgtacgat tctacctaat aatattgaac ggaattgatg ttttggacca cttagaaagt 8640 
attttttttt tggtttgtct tagctgtatt tcattaaata taaatttaaa taagaaatgt 8700 
cataaataaa atttgacgta tagatttttt aaatccattt tatgttattt aatatttgaa 8760 
atgtgagttt ggctcctatt taatcttagg atgggttaat actaagtttt ccttaatgaa 8820 
ttatctcaga gaaactggat taaataaact aaaaaataga tcaatgtgtt ttggtccggt 8880 
caaatatctt tggatttact attattggcg aaaagaaagt ctcatatagt aaatcatatt 8940 
cctacaagag aaatcaaaat ttttgaatta acatggattg tatagtttct tatataacca 9000 
attagttcgc atcaagaaaa ccaaacccca attaataatc aaacgggctt ggtaggaata 9060 
tttcattgca gctttcagat aaaagaaaaa aacacacact caagtctttt atttcatctt 9120 
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tcttacttgc aggaactcaa attccacttt 
gtcaatgaaa cgaaatagtc tttttatgca 
ctgattgtga cagccatcca tatatatagg 
tatacgtaat ggtttagcat agcttctatt 
tttttaagaa aaataaatta atatatgtat 
gtaacttaat tattaaacat tctaaatgca 
aactgaaatc aaagccagat tcataataat 
acaacatcca gaattgaaca ctcaaattgg 
gaattggttc tcttccatta ttaacgaaaa 
aaactcaaca taacattttg aacttgactc 
acaaccgatt acttgtttga gtcatcttcc 
ggtttcttcg ggtgctcatt tacggtcaag 
accgaatcga agtacaactt agctcttgct 
taccgagctc gaattcactg gccgtcgttt 
aaaatttaga acgaacttaa ttatgatctc 
taggttatca ttatgtaaga aagttttgac 
atgtaattgg tatctcaact caacattata 
aaacaactat tttttatgta tgcaagagtc 
tgacgagttc ggatgtagta gtagccatta 
atatgatgaa acattgtatc ttattgtata 
ttctttcacg gtctgaatta attatgatac 
tgaattgtat gaaatctaat tgaacaagcc 
tgactcggtt taagttaacc actaaaaaaa 
aggtcacagt catgaagcca tcaaagcaaa 
tagtttaaaa attagttaac acgagggaaa 
ttacctgtgg tcgaaatgat tcgtgtctgt 
aataaagttg taagagataa acccgcctat 
aattgtctcg ttgtcctcct cactttcatc 
agaagaacaa ggaagaagac taagagagaa 
cgttttgaat cttcctcaat ctcatcttct 
tttctggatc tactttattt gctggatctc 
ggaattcgtt taatttggat ctgtgaacct 
atctaagttg accgatcagt tagctcgatt 
ggagagatcc atgttcatgt tacctgggaa 
ctgttgaagt tagattgaat ctgaacactg 
aaggttagat gaagtttgtg tatagattct 
ttgaacagaa agctatttct gattcaatca 
gtgtgtttgc agctcataaa aaaaacgcga 
aacaacaaca tcttcttcga tctccttctc 
accattacca atctccagat tctccctccc 
ctcctcccgc cgccgcggta tcaaatccag 
cacaaccacc aatgtcacaa ccactccctc 
catctcccga ttcgctccag atcaaccccg 
agaacgtcaa ggcgtagaaa ccgtattcgc 
ccaagcctta acccgctctt cctcaatccg 
tgtattcgca gcagaaggat acgctcgatc 
ttcaggtccc ggagctacaa atctcgttag 
tcctcttgta gcaatcacag gacaagtccc 
agagactccg attgttgagg taacgcgttc 
tgttgaagat atccctagga ttattgagga 
tggacctgtt ttggttgatg ttcctaaaga 
ggaacaggct atgagattac ctggttatat 
tcatttggag cagattgtta ggttgatttc 
tggtggttgt ttgaattcta gcgatgaatt 
tgttgcgagt acgttgatgg ggctgggatc 
tatgcttgga atgcatggga ctgtgtatgc 
gttggcgttt ggggtaaggt ttgatgatcg 
tagggctaag attgttcata ttgatattga 
tcatgtgtct gtgtgtggtg atgttaagct 
gaaccgagcg gaggagctta agcttgattt 
gaaacagaag tttccgttga gctttaagac 
gattaaggtc cttgatgagt tgactgatgg 
acatcaaatg tgggcggcgc agttctacaa 



sequence listing 
gccacttttc tttacaaata aacacaaatt 9180 
aacactgttt gtcttttttc gatcacgttt 9240 
gaatgtaaaa caacaacatg tgaagtcaca 9300 
ttcgttgtca atattagtca ttccaaaaca 9360 
attcttggaa ctaatgtatg tggaaataca 9420 
aatatgcaaa gaaaaaaaag aaaagaacac 9480 
tggctacatg gttgtagaat gtagggtaac 9540 
atgatagatg gataatcttt agatacaaga 9600 
taaagaaaaa aagtttagca taaaagtttg 9660 
cttcatagga gtgacatgaa ctgacgaatc 9720 
gctttctcca ccttcgaaat gaatgtgacc 9780 
tgtaaaacat ctggtctcga gtaatgtcca 9840 
acatcaccaa gatcttgatg ggggatcggg 9900 
tacaacgact cagcacgcgt tggtttcgac 9960 
aaatacattg atacatatct catctagatc 10020 
gaatatggca cgacaaaatg gctagactcg 10080 
cttataccaa acattagtta gacaaaattt 10140 
agcatatgta taattgattc agaatcgttt 10200 
tttaatgtac atactaatcg tgaatagtga 10260 
aatatccata aacacatcat gaaagacact 10320 
aattctaata gaaaacgaat taaattacgt 10380 
aaccacgacg acgactaacg ttgcctggat 10440 
cggagctgtc atgtaacacg cggatcgagc 10500 
agaactaatc caagggctga gatgattaat 10560 
aggctgtctg acagccaggt cacgttatct 10620 
cgattttaat tatttttttg aaaggccgaa 10680 
ataaattcat atattttcct ctccgctttg 10740 
agccgttttg aatctccggc gacttgacag 10800 
agtaagagat aatccaggag attcattctc 10860 
tccgctcttt ctttccaagg taataggaac 10920 
gatcttgttt tctcaatttc cttgagatct 10980 
ccactaaatc ttttggtttt actagaatcg 11040 
atagctacca gaatttggct tgaccttgat 11100 
atgatttgta tatgtgaatt gaaatctgaa 11160 
tcaatgttag attgaatctg aacactgttt 11220 
tcgaaacttt aggatttgta gtgtcgtacg 11280 
gggtttattt gactgtattg aactcttttt 11340 
acctgcaggc atggcggcgg caacaacaac 11400 
caccaaacca tctccttcct cctccaaatc 11460 
attctcccta aaccccaaca aatcatcctc 11520 
ctctccctcc tccatctccg ccgtgctcaa 11580 
tccaaccaaa cctaccaaac ccgaaacatt 11640 
caaaggcgct gatatcctcg tcgaagcttt 11700 
ttaccctgga ggtgcatcaa tggagattca 11760 
taacgtcctt cctcgtcacg aacaaggagg 11820 
ctcaggtaaa ccaggtatct gtatagccac 11880 
cggattagcc gatgcgttgt tagatagtgt 11940 
tcgtcgtatg attggtacag atgcgtttca 12000 
gattacgaag cataactatc ttgtgatgga 12060 
agctttcttt ttagctactt ctggtagacc 12120 
tattcaacaa cagcttgcga ttcctaattg 12180 
gtctaggatg cctaaacctc cggaagattc 12240 
tgagtctaag aagcctgtgt tgtatgttgg 12300 
gggtaggttt gttgagctta cggggatccc 12360 
ttatccttgt gatgatgagt tgtcgttaca 12420 
aaattacgct gtggagcata gtgatttgtt 12480 
tgtcacgggt aagcttgagg cttttgctag 12540 
ctcggctgag attgggaaga ataagactcc 12600 
ggctttgcaa gggatgaata aggttcttga 12660 
tggagtttgg aggaatgagt tgaacgtaca 12720 
gtttggggaa gctattcctc cacagtatgc 12780 
aaaagccata ataagtactg gtgtcgggca 12840 
ttacaagaaa ccaaggcagt ggctatcatc 12900 
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aggaggcctt ggagctatgg gatttggact tcctgctgcg attggagcgt ctgttgctaa 12960 

ccctgatgcg atagttgtgg atattgacgg agatggaagc tttataatga atgtgcaaga 13020 

gctagccact attcgtgtag agaatcttcc agtgaaggta cttttattaa acaaccagca 13080 

tcttggcatg gttatgcaat gggaagatcg gttctacaaa gctaaccgag ctcacacatt 13140 

tctcggggat ccggctcagg aggacgagat attcccgaac atgttgctgt ttgcagcagc 13200 

ttgcgggatt ccagcggcga gggtgacaaa gaaagcagat ctccgagaag ctattcagac 13260 

aatgctggat acaccaggac cttacctgtt ggatgtgatt tgtccgcacc aagaacatgt 13320 

gttgccgatg atcccgaatg gtggcacttt caacgatgtc ataacggaag gagatggccg 13380 

gattaaatac tgagagatga aaccggcctg gccggcccgg agtggggagg cacgatggcc 13440 

gctttggtcg atcgacggga tcgatcctgc tttaatgaga tatgcgagac gcctatgatc 13500 

gcatgatatt tgctttcaat tctgttgtgc acgttgtaaa aaacctgagc atgtgtagct 13560 

cagatcctta ccgccggttt cggttcattc taatgaatat atcacccgtt actatcgtat 13620 

ttttatgaat aatattctcc gttcaattta ctgattgtac cctactactt atatgtacaa 13680 

tattaaaatg aaaacaatat attgtgctga ataggtttat agcgacatct atgatagagc 13740 

gccacaataa caaacaattg cgttttatta ttacaaatcc aattttaaaa aaagcggcag 13800 

aaccggtcaa acctaaaaga ctgattacat aaatcttatt caaatttcaa aaggccccag 13860 

gggctagtat ctacgacaca ccgagcggcg aactaataac gttcactgaa gggaactccg 13920 

gttccccgcc ggcgcgcatg ggtgagattc cttgaagttg agtattggcc gtccgctcta 13980 

ccgaaagtta cgggcaccat tcaacccggt ccagcacggc ggccgggtaa ccgacttgct 14040 

gccccgagaa ttatgcagca tttttttggt gtatgtgggc cccaaatgaa gtgcaggtca 14100 

aaccttgaca gtgacgacaa atcgttgggc gggtccaggg cgaattttgc gacaacatgt 14160 

cgaggctcag caggatgggc ccag 14184 



<210> 59 

<211> 1011 

<212> DNA 

<213> zea mays 

<220> 

<221> CDS 

<222> (1)..(981) 

<223> coding for 5-methylthioribose kinase 
<400> 59 

gca cga gca etc etc tec tct cct etc gee ggc gca teg ccc gac tgt 48 
Ala Arg Ala Leu Leu Ser ser Pro Leu Ala Gly Ala Ser Pro Asp Cys 
15 10 15 

cag tea gee tea gee atg gee gcg gag gag gag cag ggc ttc cgc ccg 96 
Gin Ser Ala Ser Ala Met Ala Ala Glu Glu Glu Gin Gly Phe Arg Pro 
20 25 30 

ctg gac gag teg tec ctg etc gee tac ate aag gee acg ccg gcg etc 144 
Leu Asp Glu ser Ser Leu Leu Ala Tyr lie Lys Ala Thr Pro Ala Leu 
35 40 45 

gee tec cgc etc ggc ggc ggt ggc agt eta gac tec ate gag ate aag 192 
Ala Ser Arg Leu Gly Gly Gly Gly Ser Leu Asp Ser lie Glu lie Lys 
50 55 60 

gag gtc ggc gac ggc aac etc aac ttc gtc tac ate gtg cag tec gag 240 
Glu val Gly Asp Gly Asn Leu Asn Phe Val Tyr lie Val Gin Ser Glu 
65 70 75 80 

gee ggc gee ate gtc gtc aag cag gcg etc ccg tac gtg cgc tgc gtg 288 
Ala Gly Ala lie Val val Lys Gin Ala Leu Pro Tyr Val Arg Cys val 
85 90 95 

ggg gat teg tgg ccc atg acg egg gag cgc gee tac ttc gag gee tec 336 
Gly Asp Ser Trp Pro Met Thr Arg Glu Arg Ala Tyr Phe Glu Ala Ser 
100 105 110 

acg ctg egg gag cac ggc cgc ctg tgc ccg gag cac ace ccc gag gtg 384 
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Thr Leu Arg Glu His Gly Arg Leu Cys Pro Glu His Thr Pro Glu val 
115 120 125 

tac cac ttc gac egg acc ttg teg ctg atg ggg atg cgc tac ate gag 432 

Tyr His Phe Asp Arg Thr Leu Ser Leu Met Gly Met Arg Tyr lie Glu 

130 ' " 135 140 

ccc ccg cac ate ate etc cgc aag ggc etc gtc gec ggt gtc gag tac 480 

Pro Pro His lie lie Leu Arg Lys Gly Leu val Ala Gly val Glu Tyr 

145 150 155 160 

ccg ctg etc gee gac cac atg tec gat tac atg gee aag acg etc ttc 528 

pro Leu Leu Ala Asp His Met Ser Asp Tyr Met Ala Lys Thr Leu Phe 

165 170 175 

ttc acc tec etc etc tat aac aat acc acg gat cat aag aac gga gtt 576 

Phe Thr Ser Leu Leu Tyr Asn Asn Thr Thr Asp His Lys Asn Gly val 

180 185 190 

get aag tac tct gcg aac gtg gag atg tgt agg etc acg gag caa gtt 624 

Ala Lys Tyr Ser Ala Asn Val Glu Met Cys Arg Leu Thr Glu Gin val 
195 200 205 



gtq ttc teg gac cca tac cgt gtt tec aaa ttt aat egg tgg acc teg 
val Phe Ser Asp Pro Tyr Arg val Ser Lys Phe Asn Arg Trp Thr Ser 
210 215 220 



get ctg att cat gga gat etc cac act ggt tct ate atg gtq acc gaa 
Ala Leu lie His Gly Asp Leu His Thr Gly Ser lie Met Val Thr Glu 
260 265 270 



672 



cct tat etc gac aaa gat get gag gca gtt cgc gag gat gat gag etc 720 
pro Tyr Leu Asp Lys Asp Ala Glu Ala Val Arg Glu Asp Asp Glu Leu 
225 230 235 240 

aag ttg gaa gta get ggg ctg aaa teg atg ttt ate gag aga get caa 768 
Lys Leu Glu val Ala Gly Leu Lys Ser Met Phe lie Glu Arg Ala Gin 
245 250 255 



816 



gtt caa etc aag tea ttg ate cag aat ttg ggt tct atg ggg cca atg 864 
val Gin Leu Lys Ser Leu lie Gin Asn Leu Gly ser Met Gly Pro Met 
275 280 285 

ggg ttt gat att ggg age ctt cct tgg aaa cct gat ttt ggg cat act 912 
Gly Phe Asp lie Gly Ser Leu Pro Trp Lys Pro Asp Phe Gly His Thr 
290 295 300 

atg cac aga atg ggc atg ctg ate aag cga atg ate gta agg ctt aca 960 
Met His Arg Met Gly Met Leu lie Lys Arg Met lie val Arg Leu Thr 
305 310 315 320 

aga atg gat ctt gaa gac aat tgaagagtcg tggaatttgt tccacaaaaa 1011 
Arg Met Asp Leu Glu Asp Asn 325 

<210> 60 

<211> 327 

<212> PRT 

<213> Zea mays 

<400> 60 

Ala Arg Ala Leu Leu Ser ser Pro Leu Ala Gly Ala Ser Pro Asp Cys 
15 10 15 
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Gin Ser Ala Ser Ala Met Ala Ala Glu Glu Glu Gin Gly Phe Arg Pro 
20 25 30 

Leu Asp Glu Ser ser Leu Leu Ala Tyr lie Lys Ala Thr Pro Ala Leu 
35 40 45 

Ala Ser Arg Leu Gly Gly Gly Gly Ser Leu Asp Ser lie Glu lie Lys 
50 55 60 

Glu val Gly Asp Gly Asn Leu Asn Phe val Tyr lie val Gin ser Glu 
65 70 75 80 

Ala Gly Ala lie Val val Lys Gin Ala Leu Pro Tyr Val Arg Cys Val 
85 90 95 

Gly Asp Ser Trp Pro Met Thr Arg Glu Arg Ala Tyr Phe Glu Ala Ser 
100 105 110 

Thr Leu Arg Glu His Gly Arg Leu Cys Pro Glu His Thr Pro Glu val 
115 120 125 

Tyr His Phe Asp Arg Thr Leu Ser Leu Met Gly Met Arg Tyr lie Glu 
130 135 140 

Pro Pro His lie lie Leu Arg Lys Gly Leu val Ala Gly val Glu Tyr 
145 150 155 160 

Pro Leu Leu Ala Asp His Met Ser Asp Tyr Met Ala Lys Thr Leu Phe 
165 170 175 

Phe Thr Ser Leu Leu Tyr Asn Asn Thr Thr Asp His Lys Asn Gly Val 
180 185 190 

Ala Lys Tyr Ser Ala Asn Val Glu Met Cys Arg Leu Thr Glu Gin Val 
195 200 205 

val Phe Ser Asp Pro Tyr Arg val Ser Lys Phe Asn Arg Trp Thr Ser 
210 215 220 

Pro Tyr Leu Asp Lys Asp Ala Glu Ala val Arg Glu Asp Asp Glu Leu 
225 230 235 240 

Lys Leu Glu val Ala Gly Leu Lys Ser Met Phe lie Glu Arg Ala Gin 
245 250 255 

Ala Leu lie His Gly Asp Leu His Thr Gly Ser lie Met val Thr Glu 
260 265 270 

Val Gin Leu Lys Ser Leu lie Gin Asn Leu Gly ser Met Gly Pro Met 
275 280 285 

Gly Phe Asp lie Gly Ser Leu Pro Trp Lys Pro Asp Phe Gly His Thr 
290 295 300 

Met His Arg Met Gly Met Leu lie Lys Arg Met lie val Arg Leu Thr 
305 310 315 320 

Arg Met Asp Leu Glu Asp Asn 
325 

<210> 61 
<211> 471 
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<212> DNA 

<213> Brassica napus 

<220> 

<221> CDS 

<222> (2).. (469) 

<223> coding for 5-methylthioribose kinase 
<400> 61 

a ttt ccg gqt cga cga ttt cgt gqc aat etc aac ttc gtt ttc ate gtc 49 
Phe Pro Gly Arg Arg Phe Arg Gly Asn Leu Asn Phe Val Phe lie val 
1 5 10 15 

ate gqa tec act gqc tea etc gtc ate aaa cag gcg ctt ccg tat ata 97 
lie Gly Ser Thr Gly Ser Leu Val lie Lys Gin Ala Leu Pro Tyr lie 
20 * 25 30 

cgt tgt att ggg gag tct tgg cca atg acg aaa gaa aga get tac ttt 145 
Arg Cys lie Gly Glu Ser Trp Pro Met Thr Lys Glu Arg Ala Tyr Phe 
35 40 45 

gaa get aca act ctg aga aag cac gqa get ttg tct cct gat cat gtt 193 
Glu Ala Thr Thr Leu Arg Lys His Gly Ala Leu Ser Pro Asp His val 
50 55 60 

cct gaa gtc tac cat ttt gac agg acc atg get ttg att gqa atg agg 241 
pro Glu val Tyr His Phe Asp Arg Thr Met Ala Leu lie Gly Met Arg 
65 70 75 80 

tat ctg gag cct cct cac ate ate etc cgc aaa gqa etc gtt get gqa 289 
Tyr Leu Glu Pro Pro His lie lie Leu Arg Lys Gly Leu val Ala Gly 
85 90 95 

ate cag tac cct ttc ctt gca gaa cac atg get gat tac atg gec aaa 337 
lie Gin Tyr Pro Phe Leu Ala Glu His Met Ala Asp Tyr Met Ala Lys 
100 105 110 

acc etc ttc ttc act teg etc etc tat cat gat acc aca gag cac aaa 385 
Thr Leu Phe Phe Thr Ser Leu Leu Tyr His Asp Thr Thr Glu His Lys 
115 120 125 

aga gca gta acc gag ttt tgt gqt aat gtq gag tta tgc egg tta acg 433 
Arg Ala val Thr Glu Phe Cys Gly Asn Val Glu Leu Cys Arg Leu Thr 
130 135 140 

gag caa gta gtq ttc tct gac ccg tat aga gtt tct ag 471 
Glu Gin val val Phe Ser Asp Pro Tyr Arg val Ser 
145 150 155 

<210> 62 

<211> 156 

<212> PRT 

<213> Brassica napus 

<400> 62 

Phe Pro Gly Arg Arg Phe Arg Gly Asn Leu Asn Phe val Phe lie Val 
1 5 10 15 

lie Gly Ser Thr Gly Ser Leu Val lie Lys Gin Ala Leu Pro Tyr lie 
20 25 30 

Arg cys lie Gly Glu ser Trp Pro Met Thr Lys Glu Arg Ala Tyr Phe 
35 40 45 
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Glu Ala Thr Thr Leu Arg Lys His Gly Ala Leu Ser Pro Asp His Val 
50 55 60 

Pro Glu val Tyr His Phe Asp Arg Thr Met Ala Leu lie Gly Met Arg 
65 70 75 80 

Tyr Leu Glu Pro Pro His lie lie Leu Arg Lys Gly Leu Val Ala Gly 
85 90 95 

lie Gin Tyr Pro Phe Leu Ala Glu His Met Ala Asp Tyr Met Ala Lys 
100 105 110 

Thr Leu Phe Phe Thr Ser Leu Leu Tyr His Asp Thr Thr Glu His Lys 
115 120 125 

Arg Ala val Thr Glu Phe Cys Gly Asn val Glu Leu Cys Arg Leu Thr 
130 135 140 

Glu Gin val Val Phe Ser Asp Pro Tyr Arg val Ser 
145 150 155 

<210> 63 
<211> 415 
<212> DNA 

<213> Brassica napus 

<220> 

<221> CDS 

<222> (3).. (413) 

<223> coding for 5-methylthioribose kinase 
<400> 63 

gg gtc gac gat ttc gtg ctg aga gca aaa gag atg teg ttc gat gag 47 
val Asp Asp Phe Val Leu Arg Ala Lys Glu Met Ser Phe Asp Glu 
1 5 10 15 

ttc aag ccg ttg aac gag aaa tct eta gta gag tac ata aag gca acg 95 
Phe Lys Pro Leu Asn Glu Lys Ser Leu Val Glu Tyr lie Lys Ala Thr 
20 25 30 

cct gec etc tec tec agg etc gqa gac aag tac gat gat ctg gtc ate 143 
Pro Ala Leu Ser Ser Arg Leu Gly Asp Lys Tyr Asp Asp Leu Val lie 
35 40 45 

aag gaa gtt gqa gat gqc aat etc aac ttc gtt ttc ate gtt gtc gqa 191 
Lys Glu val Gly Asp Gly Asn Leu Asn Phe val Phe lie Val val Gly 
50 55 60 

tec act gqc tea etc gtc ate aaa cag gcg ctt ccg tat ata cgt tgt 239 
Ser Thr Gly Ser Leu Val lie Lys Gin Ala Leu Pro Tyr lie Arg Cys 
65 70 75 

att gqa gaa tea tgg cca atg acg aaa gaa aga get tac ttt gaa gca 287 
lie Gly Glu Ser Trp Pro Met Thr Lys Glu Arg Ala Tyr Phe Glu Ala 
80 85 90 95 

aca act ctg aga aag cac gqt ggt ttg tct ccg gat cat gtt cct gaa 335 
Thr Thr Leu Arg Lys His Gly Gly Leu Ser Pro Asp His val Pro Glu 
100 105 110 



gtc tac cat ttt gac aga acc atg get ttg att gqa atg aga tac etc 
val Tyr His Phe Asp Arg Thr Met Ala Leu He Gly Met Arg Tyr Leu 
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115 120 125 

gag cct cct cac ate ate etc cgc aaa gga ct 415 
Glu Pro Pro His lie lie Leu Arg Lys Gly 
130 135 



<210> 64 
<211> 137 
<212> PRT 

<213> Brassica napus 
<400> 64 

val Asp Asp Phe val Leu Arg Ala Lys Glu Met Ser Phe Asp Glu Phe 
1 5 10 15 

Lys Pro Leu Asn Glu Lys Ser Leu val Glu Tyr lie Lys Ala Thr Pro 
20 25 30 

Ala Leu Ser Ser Arg Leu Gly Asp Lys Tyr Asp Asp Leu val lie Lys 
35 40 45 

Glu val Gly Asp Gly Asn Leu Asn Phe val Phe lie val Val Gly Ser 
50 55 60 

Thr Gly Ser Leu Val lie Lys Gin Ala Leu Pro Tyr lie Arg cys lie 
65 70 75 80 

Gly Glu Ser Trp Pro Met Thr Lys Glu Arg Ala Tyr Phe Glu Ala Thr 
85 90 95 

Thr Leu Arg Lys His Gly Gly Leu Ser pro Asp His Val Pro Glu val 
100 105 110 

Tyr His Phe Asp Arg Thr Met Ala Leu lie Gly Met Arg Tyr Leu Glu 
115 120 125 

Pro Pro His lie lie Leu Arg Lys Gly 
130 135 



<210> 65 
<211> 424 
<212> DNA 

<213> Oryza sativa 

<220> 

<221> CDS 

<222> (3).. (422) 

<223> coding for 5-methyl thioribose kinase 
<400> 65 

cc ctt etc tac aac tec acc act gat cac aag aaa gga gtt get cag 47 
Leu Leu Tyr Asn ser Thr Thr Asp His Lys Lys Gly val Ala Gin 
15 10 15 

tac tgc gat aat gtq gag atg tgt agg etc aca gag caa gtc gtq ttc 95 
Tyr cys Asp Asn Val Glu Met Cys Arg Leu Thr Glu Gin Val val Phe 
20 25 30 

tea gac cca tac atg etc gee aaa tac aat cgt tgc aca tea ccc ttc 143 
Ser Asp Pro Tyr Met Leu Ala Lys Tyr Asn Arg Cys Thr Ser Pro Phe 
35 40 ~ ~ 45 
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eta gat aat gat get gca gcg gtt cga gag gat get gag ctt aaa ttg 191 
Leu Asp Asn Asp Ala Ala Ala Val Arg Glu Asp Ala Glu Leu Lys Leu 
50 55 60 

gag att get gaa ttg aaa tea atg ttt att gag aga gca cag get ctt 239 
Glu lie Ala Glu Leu Lys Ser Met Phe lie Glu Arg Ala Gin Ala Leu 
65 70 75 

ctt cat gga gat etc cac act ggt tec ate atg gtq aca cca gat tct 287 
Leu His Gly Asp Leu His Thr Gly Ser lie Met val Thr Pro Asp Ser 
80 85 90 95 

act caa gtq att gat cca gaa ttt get ttc tat gqc cca atg ggt tac 335 
Thr Gin Val lie Asp Pro Glu Phe Ala Phe Tyr Gly Pro Met Gly Tyr 
100 105 110 

gac att ggg gee ttc ctg ggg aac ttg att ttg gca tat ttt tea caa 383 
Asp lie Gly Ala Phe Leu Gly Asn Leu lie Leu Ala Tyr Phe Ser Gin 
115 120 125 

gat gga cac get gat caa gca aat gat cgt aag get tac aa 424 
Asp Gly His Ala Asp Gin Ala Asn Asp Arg Lys Ala Tyr 
130 135 140 

<210> 66 
<211> 140 
<212> PRT 

<213> Oryza sativa 
<400> 66 

Leu Leu Tyr Asn Ser Thr Thr Asp His Lys Lys Gly Val Ala Gin Tyr 
15 10 15 

Cys Asp Asn val Glu Met Cys Arg Leu Thr Glu Gin val val Phe Ser 
20 25 30 

Asp Pro Tyr Met Leu Ala Lys Tyr Asn Arg Cys Thr Ser Pro Phe Leu 
35 40 45 

Asp Asn Asp Ala Ala Ala Val Arg Glu Asp Ala Glu Leu Lys Leu Glu 
50 55 ~ 60 

lie Ala Glu Leu Lys Ser Met Phe lie Glu Arg Ala Gin Ala Leu Leu 
65 70 75 80 

His Gly Asp Leu His Thr Gly Ser lie Met Val Thr Pro Asp Ser Thr 
85 90 95 

Gin Val lie Asp Pro Glu Phe Ala Phe Tyr Gly Pro Met Gly Tyr Asp 
100 105 110 

lie Gly Ala Phe Leu Gly Asn Leu lie Leu Ala Tyr Phe Ser Gin Asp 
115 120 125 

Gly His Ala Asp Gin Ala Asn Asp Arg Lys Ala Tyr 
130 135 140 

<210> 67 
<211> 404 
<212> DNA 
<213> Glycine max 
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<220> 

<221> CDS 

<222> (3).. (404) 

<223> coding for 5-methylthioribose kinase 
<400> 67 

ta ate ccc gaa cat gtt cct gaa gtg tat cac ttt gac cgt acc atg 47 
lie Pro Glu His val Pro Glu val Tyr His Phe Asp Arg Thr Met 
1 5 10 15 

tct ttg ate gqt atg cgt tac ttg gag ccc cca cat ata ate etc ata 95 
ser Leu lie Gly Met Arg Tyr Leu Glu Pro Pro His lie lie Leu lie 
20 25 30 

aaa ggg ttg att get ggg att gag tac cct ttt ttg get gaa cac atg 143 
Lys Gly Leu lie Ala Gly lie Glu Tyr Pro Phe Leu Ala Glu His Met 
35 40 45 

get gat ttc atg gcg aag aca etc ttc ttc acg tct ctg ctt ttc cgt 191 
Ala Asp Phe Met Ala Lys Thr Leu Phe Phe Thr ser Leu Leu Phe Arg 
50 55 60 

tec act get gac cac aaa egg gac gtt gee gaa ttt tgt ggg aat gtg 239 
Ser Thr Ala Asp His Lys Arg Asp Val Ala Glu Phe Cys Gly Asn val 
65 70 75 

gag tta tgc agg etc act gaa cag gtc gtt ttc tct gac cct tat aaa 287 
Glu Leu Cys Arg Leu Thr Glu Gin Val Val Phe Ser Asp Pro Tyr Lys 
80 85 90 95 

gtt tct caa tat aat cgt tgg act tec ccc tat ctt gat cgt gat get 335 
Val Ser Gin Tyr Asn Arg Trp Thr Ser pro Tyr Leu Asp Arg Asp Ala 
100 105 110 

gag get gtt egg gaa gac aat ctg ctg aag ctt gaa gtt get gag ctg 383 
Glu Ala val Arg Glu Asp Asn Leu Leu Lys Leu Glu Val Ala Glu Leu 
115 120 125 

aaa tec aag ttc att gag age 404 
Lys Ser Lys Phe lie Glu Ser 
130 

<210> 68 
<211> 134 
<212> PRT 
<213> Glycine max 

<400> 68 

lie Pro Glu His Val Pro Glu Val Tyr His Phe Asp Arg Thr Met Ser 
1 5 10 15 

Leu lie Gly Met Arg Tyr Leu Glu Pro Pro His lie lie Leu lie Lys 
20 25 30 

Gly Leu He Ala Gly lie Glu Tyr Pro Phe Leu Ala Glu His Met Ala 
35 40 45 

Asp Phe Met Ala Lys Thr Leu Phe Phe Thr Ser Leu Leu Phe Arg Ser 
50 55 60 

Thr Ala Asp His Lys Arg Asp val Ala Glu Phe Cys Gly Asn Val Glu 
65 70 75 80 
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Leu Cys Arg Leu Thr Glu Gin Val Val Phe Ser Asp Pro Tyr Lys val 
85 90 95 

Ser Gin Tyr Asn Arg Trp Thr Ser Pro Tyr Leu Asp Arg Asp Ala Glu 
100 105 110 

Ala Val Arg Glu Asp Asn Leu Leu Lys Leu Glu Val Ala Glu Leu Lys 
115 120 125 

Ser Lys Phe lie Glu Ser 
130 

<210> 69 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 69 

cgtgaatacg gcgtggagtc g 



<210> 70 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 70 

cggcaggata atcaggttgg 



<210> 71 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: 
oligonucleotide primer 

<400> 71 

gtcaacgtaa ccaaccctgc 
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